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ABSTRACT 


This  paper  reviews  the  state  of  the  art  in  enumerative  solution  methods 
for  the  traveling  salesman  problem  (TSP).  The  introduction  (Section  1) 
discusses  the  main  ingredients  of  branch  and  bound  methods  for  the  TSP. 
Sections  2,  3  and  4  discuss  classes  of  methods  based  on  three  different  re¬ 
laxations  of  the  TSP:  the  assignment  problem  with  the  TSP  cost  function,  the 
1-tree  problem  with  a  Lagrangean  objective  function,  and  the  assignment 
problem  with  a  Lagrangean  objective  function.  Section  5  briefly  reviews  some 
other  relaxations  of  the  TSP,  while  Section  6  discusses  the  performance  of 
some  state  of  the  art  computer  codes.  Besides  material  from  the  literature, 
the  paper  also  includes  the  results  and  statistical  analysis  of  some  computa¬ 
tional  experiments  designed  for  the  purposes  of  this  review. 
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1.  Introduction 


Since  che  firsc  attempc  to  solve  traveling  salesman  problems  by  an  enuner- 
acive  approach,  apparently  due  to  Eastman  [1958],  many  such  procedures  have  been 
proposed.  In  a  sense  the  TSP  has  served  as  a  testing  ground  for  the  development 
of  solution  methods  for  discrete  optimization,  in  that  many  procedures  and  devices 
were  first  developed  for  the  TSP  and  then,  after  successful  testing,  extended  to 
more  general  integer  programs.  The  term  "branch  and  bound"  itself  was  coined  by 
Little,  Murty,  Sweeney  and  Karel  [1963]  in  conjunction  with  their  TSP  algorithm. 

Enumerative  (branch  and  bound,  implicit  enumeration)  methods  solve  a  dis¬ 
crete  optimization  problem  by  breaking  up  its  feasible  set  into  successively 
smaller  subsets,  calculating  bounds  on  the  objective  function  value  over  each 
subset,  and  using  them  to  discard  certain  subsets  from  further  consideration. 

The  bounds  are  obtained  by  replacing  the  problem  over  a  given  subset  with  an  easier 
(relaxed)  problem,  such  that  the  solution  value  of  the  latter  bounds  that  of  the 
former.  The  procedure  ends  when  each  subset  has  either  produced  a  feasible 
solution,  or  was  shown  to  contain  no  better  solution  than  the  one  already  in 
hand.  The  best  solution  found  during  the  procedure  is  a  global  optimum. 

For  any  problem  P,  we  denote  by  v(P)  the  value  of  (an  optimal  solution 
to)  P.  The  essential  ingredients  of  any  branch  and  bound  procedure  for  a  dis¬ 
crete  optimization  problem  P  of  the  form  minr.f(x)|x  €  s]  are 

(i)  a  relaxation  of  P,  i.e.  a  problem  R  of  the  form  min-'g(x)jx  c  T], 
such  that  SiT  and  for  every  x,y€S,  f(x)  <  f(y)  implies  g(x)  <  g(y). 

(ii)  a  branching  or  separation  rule,  i.e.  a  rule  for  breaking  up  the 
feasible  set  S,  of  the  current  subproblem  P^  into  subsets 

q 


’ii’ 


'iq’ 


such  that 


'ij 


a  lower  bounding  procedure,  i.e.  a  procedure  for  finding  (or 
approximating  from  below)  v(EL)  for  the  relaxation  of  each 

subproblem  P  ;  and 

(iv)  a  subproblem  selection  rule,  i.e.  a  rule  for  choosing  the  next 
subproblem  to  be  processed. 

Additional  ingredients,  not  always  present  but  always  useful  when  present, 

are 

(v)  an  upper  bounding  procedure,  i.e.  a  heuristic  for  finding  feasible 
solutions  to  ?;  and 

(vi)  a  testing  procedure,  i.e.,  a  procedure  for  using  the  logical  implications 
of  the  constraints  and  bounds  to  fix  the  values  of  some  variables 
(reduction,  variable  fixing)  or  to  discard  an  entire  subproblem 
(dominance  tests). 

For  more  information  on  enumerative  methods  in  integer  programming  see, 

for  instance,  Chapter  4  of  Garfinkel  and  Nemhauser  [1972],  and/or  the  surveys 
by  Balas  [1975],  Balas  and  Guignard  [1979],  3eale  [1979],  Speilberg  [1979]. 

Since  by  far  the  most  important  ingredient  is  (i),  we  will  classify  the 
branch  and  bound  procedures  for  the  TSP  according  to  the  relaxation  that  they 
use. 

The  integer  programming  formulation  of  the  TSP  that  we  will  refer  to  when 
discussing  the  various  solution  methods  is  defined  on  a  complete  directed  graph 

G  »  (V,A]>  on  n  nodes,  with  node  set  V  =  [l . n},  arc  set  A  =  i(i,j)]i,j  =  1,... 

and  nonnegative  costs  c^  associated  with  the  arcs.  The  fact  that  G  is 


complete  involves  no  restriction,  since  arcs  that  one  wishes  to  ignore  can  be 
assigned  the  cost  c_  =  =.  In  all  cases  c^  =  =  ,  V  ioV.  The  TS?  can  be 
formulated,  following  Dantzig,  Fulkerson  and  Johnson  [1954],  as  the  problem 


min  Z  Z  c. .x.  . 
iSVjeV  1J  LJ 


Z  x. .  »  1  ,  i€V 

j€V  1J 

Z  x. .  =  1  ,  j€V 

1  rz\  J 


z  z  x  <  ! s[ -1  , 

i£Sj£3  ‘J 

»tj  -  0  or  1,  i,j€V. 


VSJ  V,  SM> 


where  x  =  1  if  arc  (i,j)  is  in  the  solution,  x  *  0  otherwise. 

“  -i  J 

The  subcour  elimination  inequalities  (3)  can  also  be  written  as 


Z  Z  x  >  1  , 

i€Sj€Vv  S 


V  S  V,  S Hfl 


A  very  important  special  case  is  the  symmetric  TSP,  in  which  c.^  *  cji’ 
Vi,j  .  The  symnetric  TSP  can  be  defined  on  a  complete  undirected  graph  G  =(V,E) 
on  n  nodes,  with  node  set  V,  edge  set  E,  and  arbitrary  costs  c^..  It  can 
be  stated  as 


min  Z  Z  c,.x,, 
i€VJ>i  lj  ij 


Z  x . .  Z  x  =2  , 

j<i  -1  j>t  1J 


t 
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(8)  Z  Z  x  <  jsj-1,  V  S  c  V,  S*? 

i€SjcS 

j>i 

(i)  =  0  or  1,  i,j€V  ,  j  >i 

where  the  subtour  elimination  inequalities  (8)  can  also  be  written  as 

(10)  Z  Z  x.  .  +  Z  Z  x.  .  >  2  ,  V  S  C  V,  SM 

i€Sj€v\S  1J  i€V\Sj€S  1J 
j>i  j>i 

Next  we  outline  two  versions  of  a  branch  and  bound  procedure  for  the  TSP. 
Prior  to  using  any  of  these  versions,  a  relaxation  R  of  the  TSP  must  be 
chosen.  Both  versions  carry  at  all  times  a  list  of  active  subproblems.  They 
differ  in  that  version  1  solves  a  (relaxed)  subproblem  only  when  node  k  is 

selected  and  taken  off  the  list,  while  version  2  solves  each  (relaxed)  sub¬ 
problem  as  soon  as  it  is  created,  i.e.  before  it  is  placed  on  the  list. 

Although  the  branch  and  bound  procedures  used  in  practice  differ  among  them¬ 
selves  in  many  details,  nevertheless  all  of  them  can  be  viewed  as  variants  of 
one  of  these  two  versions. 

Branch  and  bound  method  for  the  TSP 
Version  1 

1.  (initialization).  Put  TSP  on  the  list  (of  active  subproblems).  Initia¬ 
lize  the  upper  bound  at  U  =  ®,  and  go  to  2. 

2.  (Subproblem  selection).  If  the  list  is  empty,  stop:  the  tour  associated 

with  U  is  optimal  (0r,  if  U  =  ®,  TSP  has  no  solution).  Otherwise  choose 
a  subproblem  TSPi  according  to  the  subproblem  selection  rule,  remove  TSP^ 
from  the  list,  and  go  to  3. 
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3.  (Lower  bounding).  Solve  Che  relaxation  R.  of  TSP.  or  bound  v(R.)  from 

1  1  '1 

below,  and  let  L.  be  the  value  obtained. 
i 

If  L.  >  U,  return  to  2. 

l  - 

If  <  U  and  the  solution  defines  a  tour  for  TSP,  store  ic  in  place 

of  the  previous  best  tour,  set  U  *-  L^ ,  and  go  to  5. 

If  <  U  and  the  solution  does  not  define  a  tour,  go  to  4. 

4.  (Upper  bounding:  optional).  Use  a  heuristic  to  find  a  tour  in  TSP.  If 
a  better  tour  is  found  than  the  current  best,  store  it  in  place  of  the 
latter  and  update  U.  Go  to  5. 

5.  (Reduction:  optional).  Remove  from  the  graph  of  TSP^  all  the  arcs  whose 
inclusion  in  a  tour  would  raise  its  value  above  U,  and  go  to  6. 

6.  (Branching).  Apply  the  branching  rule  to  TS?i,  i.e.  generate  new 

subproblems  TS?^,...,  TSP  ,  place  them  on  the  list,  and  go  to  2. 

Version  2 

1.  (Initialization).  Like  in  version  1,  but  solve  R  before  putting  TSP 
on  the  list. 

2.  (Subproblem  selection).  Same  as  in  version  1. 

3.  (Upper  bounding:  optional).  Same  as  step  4  of  version  1,  with  "go  to  5" 
replaced  by  "go  to  4." 

4.  (Reduction:  optional).  Same  as  step  5  of  version  1,  with  "go  to  6"  replaced 
by  "to  go  3." 

5.  (Branching).  Use  the  branching  rule  to  define  the  set  of  subproblems 
TS?il» • * • ’TS?iq  t0  ^e  generated  from  the  current  subproblem  TSP^, 
and  go  to  6. 


6 


^.Lcwer  bounding).  If  all  the  subproblems  to  be  generated  from  TS? 

according  to  the  branching  rule  have  already  been  generated,  go  to  2. 

Otherwise  generate  the  next  suboroblem  TSP. .  defined  by  the  branching 

ij  3 

rule,  solve  the  relaxation  R_  of  TSP_  or  bound  v(R_)  from  below, 

and  let  L. .  be  the  value  obtained, 
ij 

If  L. .  >  U,  return  to  6. 
ij  — 

If  <  U  and  the  solution  defines  a  tour  for  TSP,  store  it  in  place 

of  the  previous  best  tour,  set  U  —  ,  an<*  8°  Co  &> 

If  L. .  <"  U  and  the  solution  does  not  define  a  tour,  place  TSP. .  on  the  list 
lj  lj 

and  return  to  6. 


In  both  versions,  the  procedure  can  be  represented  by  a  rooted  tree  (search 
or  branch  and  bound  tree)  whose  nodes  correspond  to  the  subproblems  generated, 
with  the  root  node  corresponding  to  the  original  problem,  and  the  successor  nodes 
of  a  given  node  i  associated  with  TS?^  corresponding  to  the  subproblems 
TSP . 1 , . . . ,TSP^  defined  by  the  branching  rule. 

It  is  easy  to  see  that  under  very  mild  assumptions  on  the  branching  rule 
and  the  relaxation  used,  both  versions  of  the  above  procedure  are  finite  (see 
Exercise  1). 

Next  we  discuss  various  specialisations  of  the  procedure  outlined  above, 
classified  according  to  the  relaxation  that  they  use.  When  assessing  and 
comparing  the  various  relaxations,  one  should  !;eep  in  mind  that  a  "good"  re¬ 
laxation  is  cne  that  (i)  gives  a  strong  lower  bound,  i.e.  yields  a  small 
difference  v(TS?)  -  v(R)  ;  and  (ii)  is  easy  to  solve.  Naturally,  these  are 
often  conflicting  goals,  and  in  such  cases  one  has  to  weigh  the  tradeoffs. 


-«  Relaxation  I:  The  Assignment  Problem  with  che  TS?  Cost  F”rc"ic" 

The  most  straightforward  relaxation  of  the  ISP,  and  hist  .ricaliy  the  first 
one  to  have  been  used,  is  the  problem  obtained  from  the  integer  programming 
formulation  (1),  (2),  (3),  (4)  by  removing  the  constraints  (3),  i.e.  the 
assignment  problem  (AP)  with  the  same  cost  function  as  ISP.  <*.  was  used, 
among  others,  by  Eastman  [1958],  Little,  Murty,  Sweeney  a  varel  [1963], 

Shapiro  [1966],  Bellmore  and  Malone  [1971],  Smith,  Srini  sn  and  Thompson 
[1977],  Carpaneto  and  Toth  [1980], 

An  assignment  (i.e.,  a  solution  to  AP)  is  a  union  of  directed  cycles, 
hence  either  a  tour,  or  a  collection  of  subtours.  There  are  n!  distinct 
assignments,  of  which  (n-1) !  are  tours.  Thus  on  the  average  one  in  every  n 
assignments  is  a  tour.  Furthermore,  in  the  current  context  only  those  assign¬ 
ments  are  of  interest  that  contain  no  diagonal  elements  (i.e.,  satisfy  * 

0,  i=l,**»,n),  and  their  number  is  n!/e  rounded  to  the  nearest  integer, 
i.e.  [n!/e  +  1/2]  (see,  for  example,  Hall  [1967],  p.  10).  Thus  on  the  average  one 
in  every  n/e  "diagonal-free"  assignments  is  a  tour.  This  relatively  high  fre¬ 
quency  of  tours  among  assignments  suggests  that  v(AP)  is  likely  to  be  a  pretty 
strong  bound  on  v(TSP),  and  computational  experience  with  AP-based  solution 
methods  supports  such  a  view.  To  test  how  good  this  bound  actually  is  for 
randomly  generated  problems,  we  performed  the  following  experiment.  We  gen¬ 
erated  400  problems  with  50  <  n  <  250  with  the  costs  independently  drawn  from  a 
uniform  distribution  of  the  integers  over  the  intervals  [1,100]  and  [1,1000], 
and  solved  both  A?  and  TSP.  We  found  that  on  the  average  v(A?)  was  99.2%  of 
v!T3?).  Furthermore,  we  found  che  bound  to  improve  with  problem  size,  in  that 
for  the  problems  with  50  <  n  <  150  and  150  <  n  <  250  the  outcomes  were  98.8% 
and  ?9 . 5%,  respectively. 
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A?  can  be  solved  by  che  Hungarian  method  (Kuhn  [1955' ;  for  a  more  recent 

treatment,  see  Christofides  [1975  1  or  Lawler  [19751'/  in  at  most  sters. 

The  assignment  problems  A?;  to  be  solved  at  every  node  of  che  search  tree  differ 

from  the  initial  assignment  problem  A?  in  that  some  arcs  are  excluded  Cforbicaen) 
from,  while  other  arcs  are  included  (forced)  into  the  solution.  These  modifi¬ 
cations  do  not  present  any  difficulty.  Since  the  Hungarian  method  provides  at 
every  iteration  a  lower  bound  onv(Ap),  the  process  of  solving  a  subproblem  can 
be  stopped  whenever  the  lower  bound  meets  the  upper  bound  U.  More  importantly, 
in  the  typical  case  (see  the  branching  rules  below) ,  the  assignment  problem  A? . 
to  be  solved  at  node  j  of  the  search  tree  differs  from  che  problem  AP^  solved 
at  the  parenc  node  i  only  in  that  a  certain  arc  belonging  to  the  optimal  solution 
of  A?,  is  excluded  from  the  solution  of  AP^  ,  and  possibly  some  other  arcs  are 
required  to  maintain  the  same  position  (in  or  ouc)  with  respect  to  the  solution 
of  A?j,  that  they  have  with  respect  to  that  of  AP^.  Whenever  this  is  the  case, 
the  problem  AP  can  be  solved  by  the  Hungarian  method  starting  from  the  optimal 
solution  of  the  problem  at  the  parent  node  (or  at  a  brother  node),  in  at  most  0(n“) 
steps  (see  Exercise  2  or  3ellmore  and  Malone  [  1971]).  For  an  efficient  inplementa- 
tation  of  this  version  of  the  Hungarian  method,  which  uses  on  the  average  consider- 

ably  less  than  0(n~)  steps,  see  Carpaneto  and  Toth  [1980].  The  primal  simplex 
method  for  the  assignment  problem  has  also  been  used  in  a  parametric  version 

to  solve  efficiently  this  sequence  of  interrelated  assignment  problems  by 
Smith,  Srinivasan  and  Thompson  [1977], 

The  lower  bound  v(AP)  can  be  slightly  improved  by  addition  of  a  penalty. 

This  can  be  calculated  as  the  minimal  increase  in  the  objective  function 
caused  either  by  a  first  simplex  pivot  chat  eliminates  some  arc  from  the 
s.luci'r..  or  by  a  first  iteration  of  the  Hungarian  method  chat  accomplishes  the  same 
to.io.a.  : ur tr ermore ,  the  arc  to  be  included  in  the  solution  bv  the  oivoc  car.  be 
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-3a  cricked  to  a  cutset  defined  by  some  subcour  of  the  A?  solution.  Computa¬ 
tional  experience  indicates,  however,  that  the  impact  of  such  a  penalty  tends 
to  decrease  with  problem  size  and  is  negligible  for  anything  but  small  problems. 
In  the  computational  experiment  involving  the  400  randomly  generated  problems 
that  we  ran,  the  addition  of  a  penalty  to  vfAP)  raised  the  value  of  the  lower 
bound  on  the  average  by  0.03%,  from  99.2%  to  99.23%  of  v(TSP). 

Branching  rules 

Several  branching  rules  have  been  used  in  conjunction  with  the  AP  relaxa- 
ation  of  the  TSP.  In  assessing  the  advantages  and  disadvantages  of  these  rules 
one  should  keep  in  mind  that  the  ultimate  goal  is  to  solve  the  TSP  by  solving 

as  few  subprcblems  as  possible.  Thus  a  "good"  branching  rule  is  one  that 
;a)  generates  few  successors  of  a  node  of  the  search  tree,  and  (b)  generates 

strongly  constrained  subproblems,  i.e.  excludes  many  solutions  from  each 
subproblem.  Again,  these  criteria  are  usually  conflicting  and  the  merits  of 
the  various  rules  depend  on  the  tradeoffs. 

We  will  discuss  the  various  branching  rules  in  terms  of  sets  of  arcs 
excluded  (E^)  from,  and  included  (I,  )  into  the  solution  of  subproblem 
k.  In  terms  of  the  variables  x^  ,  the  interpretation  of  these  sets  is  that 
subproblem  k  is  defined  by  the  conditions 

f  0 ,  ( i  ,  j  )  c  E. 

.11)  x. .  =  <  K 

|^1,  (iJ)  €  Ik 

in  addition  to  (1),  (2),  (3),  (4).  Thus  the  relaxation  of  subproblem  k  is 
:iven  by  ('ll)  in  addition  to  (1),  (2),  (4).  We  abbreviate  Branching  Rule 


m+mmtm 
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BR  1.  (Little,  Murty,  Sweeney  and  Karel  [1963]),  Given  the  current  re¬ 
laxed  subproblem  AP,  and  its  reduced  costs  c.  .  =c..  -u.  -  v.,  where  u. 

k  lj  lj  i  j  l 

and  v  are  optimal  dual  variables,  for  every  arc  (i,j)  such  that  cV^  =  0 
define  the  penalty 

p.  .  =  min  *i c*. ,  :  hcV\,r  jlT  +  min  -(c"  .  :  h€V'  {ill 

i j  v  lh  * J  ■*  j  c  hj  c  , 

and  choose  (r,s)  5  A  such  that 

r  _  i 

p  =  max  <  p .  .  :  c..  =  0  (  . 

“rs  _rij  ij  i 

Then  generate  two  successors  of  node  k,  nodes  k  +  1  and  k  +  2,  by 
defining 


Ek+1  “  Ek  ^(r,S)-  ’ 


k+1 


\ 


and 


Ek+2  ~  Ek  ’  \+2  ~  \  !JC(r,s)}  . 

This  rule  does  not  use  the  special  structure  of  the  TSP  (indeed,  it  applies 

to  any  integer  program),  and  has  the  disadvantage  that  it  leaves  the 

optimal  solution  to  AP,  feasible  for  AP, 

k  k+2 

The  following  rules  are  baseu  on  disjunctions  derived  from  the  subtour 
elimination  inequalities  (3)  or  (5), 

BR  2,  (Eastman  [1958],  Shapiro  [1966]),  Let  x  be  the  optimal  solution  to 

the  current  relaxed  subproblem  AP  ,  and  let  A  =  f (i  ,i  ) . (i  ,i  ) }  be  the 

tC  o  L  Z  Cl 

arc  set  of  a  minimum  cardinality  subtour  of  x  involving  the  node  set 
S  =  fij,,,,,it}.  Constraint  (3)  for  S  implies  the  inequality 


( 3  ) 


(i,j)?As  J 


£  IS  i  -  1, 


which  in  turn  implies  the  disjunction 


(12)  xi  i  "  0  V  ...  v  x  =  0  . 

i  2  Vl 


11 


Gene race 


successors  of  node  k,  defined  by 


:k  (ir,ir-l'1  • 


(131 


/*  r  =  1 , .  i  •  ,  t 


I,  =  I, 
k+r  k 


with  it+l  =  il  * 

k 

Now  x  is  clearly  infeasible  for  all  A.P,^+r>  r  =  l,,..,t,  and  Che  choice 
of  a  shortest  subtour  for  branching  keeps  the  number  of  successor  nodes  small. 
However,  the  disjunction  (12)  does  not  define  a  partition  of  the  feasible  set 
of  AP,  ,  and  thus  different  successors  of  AP,  may  have  solutions  in  common.  This 

rC  rC 

shortcoming  is  remedied  by  the  next  rule,  which  differs  from  SR  2  only  in  that 
it  strengthens  the  disjunction  (12)  fo  one  that  defines  a  partition. 

BR  3.  (Murty  [1968],  Bellmore-Malone  [1971],  Smith,  Srinivasan  and 
Thompson  [1977]).  The  disjunction  (12)  can  be  strengthened  to 

(U)  (x.  .  =  0)  V(x  =  1,  x,  =  0)  V...V(x  =...=  x.  .  -  1,  x.  .  =  0 

‘1*2  11L2  "2l3  V2  lt-l‘t  Vl 


and  accordingly  (13)  can  be  replaced  by 
^k+r  “  \  (ir,ir+l)' 


(13) 


IT  ^  1 1  •  •  •  j  C 


Zk+r  *  \  -'•'(i1«i2)»***’(ir-l,1r)- 


/ 


with  i  .  =  i. 

C-rl  1 


slightly  different  version  of  3R  3  (as  well  as  of  3R  21  is  to  replace 

-he  acre  set  A.  of  a  minimum-cardinalicv  subtour  with  that  oi  a  subtour  vrth  o 
o 
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~ir.ir.urr.  number  of  free  edges  (i.e.  edaes  of  E.  „I,  ),  This  rule  is  used  ir. 

~  rC  is. 

Carpar.eco  and  Toth  [1980], 

1c  .. 

3R  4,  (3ellmore  and  Malone  [1971]).  Let  x  and  S  be  as  before. 
Constraint  (5)  implies  the  disjunction 


(16)  (x.  .  =  0,  j€S)  V(x.  =  0,  j€S)  V...V(x  =  0,  j€S). 

xl-  12J  tJ 


Generate  t  successors  of  node  k,  defined  by 


k+r 


=  E.  ~{(i  ,j)  :  j€S>; 


(17) 


r  =  1,... ,t 


~k-r  k 

Like  in  the  case  of  3R  2,  Br  4  makes  x  infeasible  for  all  successor 
problems  of  AP.^,  but  again  (16)  does  not  partition  the  feasible  sec  of  AP^. 
This  is  remedied  by  the  next  rule,  which  differs  from  BR  4  only  in  that  it 
defines  a  partition. 

3R  5.  (Garfinkel  [1973]).  The  disjunction  (16)  can  be  strengthened  to 


(18) 


x.  .  =0,  j€S)  V(x  .  =0,  j€V\S;  x  .  =0,  j€S)  V... 
11J  X1J  X2J 


V(::i  j  -  0,  j€V\S  ,r  =  l,...,t-l;  ^  ^  =  0,  j€S) 
r  £ 


and  accordingly  (17)  can  be  replaced  by 


I,.  _  -  E.  J;(i_,j):  J€S}  L  (  (i  ,  j ) :  q«l,...,r-l,  jSV'.S] 

ic-rr  t  M 


,191 


— 1 
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The  two  rules  BR  2  and  BR  4  (or  their  strengthened  variants,  BR  3  and 
BR  5),  based  on  the  subtour-elimination  constraints  (3/)  and  (5),  respectively, 
generate  the  same  number  of  successors  of  a  given  node  k.  However,  the  rule 
based  on  inequality  (5)  generates  more  tightly  constrained  subproblems,  i.e., 
excludes  a  greater  number  of  assignments  from  the  feasible  set  of  each  successor 
problem,  than  the  rule  based  on  inequality  (3').  Indeed,  with  js[  =  k,  we  have 

Theorem  1.  (Bellmore  and  Malone  [1971]).  Each  inequality  (3;)  eliminates 
l(n-k)!/e  4-  1/2 ]  diagonal-free  assignments,  whereas  each  inequality  (5)  eliminates 
l(n-k)!/e  +  1/2]  •  lk!/e  +  1/ 2 ]  diagonal-free  assignments. 

Proof.  Each  inequality  (3 /)  eliminates  those  diagonal-free  assignments 
that  contain  the  subtour  with  arc  set  Ag .  There  are  as  many  such  assignments 
as  there  are  diagonal-free  assignments  in  the  complete  graph  defined  on  node 
set  V.S,  and  the  number  of  these  is  (n-k)!/e  rounded  to  the  nearest  integer,  i.e,, 
l(n-k)!/e  +  1/ 2 |  (see  section  2). 

On  the  other  hand,  each  inequality  (5)  eliminates  those  diagonal-free 
assignments  consisting  of  the  union  of  two  such  assignments,  one  in  the  complete 
graph  defined  on  S,  the  other  in  the  complete  graph  defined  on  V\S.  Since  the 
number  of  the  latter  is  l(n-k)!/e  +  1/2 )  and  that  of  the  former  is  [k!e  +  1/2], 
the  number  of  diagonal-free  assignments  eliminated  by  each  inequality  (5)  is  as 
stated  in  the  theorem./ 

Nevertheless,  both  Smith,  Srinivasan  and  Thompson  [1977]  and  Carpaneto  and 
Toth  [1980]  found  their  respective  implementations  of  BR  3  more  efficient  than 
BR  4  or  BR  5,  both  in  terms  of  total  computing  time  and  number  of  nodes  generated. 
We  have  no  good  explanation  for  this. 


A 


.?  the 


i 
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r  features 

The  subproblem  selection  rule  used  by  many  branch  and  bound  algorithms  is  the 
one  known  as  "depth  first"  or  LIFO  (last  in  first  out).  It  amounts  to  choosing 
one  of  the  nodes  generated  at  the  last  branching  step  (in  order,  for  instance, 
of  nondecreasing  penalties,  like  in  Smith,  Srinivasan  and  Thompson 

[1977]);  and  when  no  more  such  nodes  exist,  backtracking  to  the  parent  node 
and  applying  the  same  rule  to  its  brother  nodes.  This  rule  has  the  advantage 

of  modest  storage  requirements  and  easy  bookkeeping.  Its  disadvantage  is  that 
possible  erroneous  decisions  (with  respect  to  arc  exclusion  or  inclusion'  made 
early  in  the  procedure  cannot  be  corrected  until  late  in  the  procedure. 

The  alternative  extreme  is  known  as  the  "breadth  first"  rule,  which 
amounts  to  always  choosing  the  node  with  the  best  lower  bound.  This  rule  has 
the  desirable  feature  of  keeping  the  size  of  the  search  tree  as  small  as  possible, 
(see  Exercise  3h ,  but  on  the  other  hand  requires  considerable  storage  space.  In 
order  to  keep  simple  the  passage  from  one  subproblem  to  the  next  one,  this  rule 

must  be  embedded  in  a  procedure  patterned  after  version  2  of  the  outline  in  the 

introduction,  which  solves  each  assignment  problem  as  soon  as  the  corresponding  node 
is  generated,  and  places  on  the  list  only  those  subproblems  TSP^  with  <  U. 

The  procedure  of  Carpaneto  and  Toth  [1980]  uses  this  rule,  and  it  chooses  the 
subproblems  to  be  processed  (successors  of  a  given  node)  in  the  order  defined 
by  the  arc  adjacencies  in  the  subtour  that  serves  as  a  basis  for  the  branching. 

As  mentioned  earlier,  the  high  frequency  of  tours  among  assignments  makes 
A?  a  relatively  strong  relaxation  of  TSP,  which  in  the  case  of  random  (asymmetric) 
costs  provides  an  excellent  lower  bound  on  v(TSP).  However,  in  the  case  of 
the  symmetric  TSP,  the  bound  given  by  the  optimal  A?  solution  is  substantially 

weaker.  An  experiment  that  we  ran  on  140  problems  with  40  <  n  <  100  and  with 

symmetric  costs  independently  drawn  from  a  uniform  distribution  of  the  integers 
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in  the  interval  [1,  1000],  showed  v(AP)  to  be  on  the  average  827=  of  v(TSP), 

while  the  addition  of  a  penalty  raised  the  bound  to  857,.  The  explanation  of  the 

relative  weakness  of  this  bound  is  pretty  straightforward:  in  the  symmetric  case, 

there  is  a  tendency  towards  a  certain  symmetry  also  in  the  solution,  to  the  effect 

that  if  x. .  =  1,  then  (since  c..  =  c..),  one  tends  to  have  x..  =  1  too: 
ij  iJ  ji  ’  Ji 

and  thus  the  optimal  AP  solution  usually  contains  a  lot  of  subtours  of  length  2, 
irrespective  of  the  si2e  of  n  .  Thus  as  a  rule,  a  much  larger  number  of 
subtours  has  to  be  eliminated  before  finding  an  optimal  tour  in  the  symmetric 
case  than  in  the  asymmetric  one.  This  makes  the  AP  a  poor  relaxation  for 
the  symmetric  TSP. 

j.  Relaxation  II:  The  1-Tree  Problem  with  Lagrangean  Objective  Function 

This  relaxation  was  successfully  used  for  the  symmetric  TSP  first  by  Held 
and  Karp  [1970,  1971]  and  Christofides  [1970],  and  subsequently  by  Helbig  Hansen 
and  Krarup  [1974],  Smith  and  Thompson  [1977],  Volgenant  and  Jonker  [1982]. 

Consider  the  symmetric  TSP  and  the  undirected  (complete)  graph  G  =  (V,E) 
associated  with  it.  The  problem  of  finding  a  connected  spanning  subgraph  H 
of  G  with  n  edges,  that  minimizes  the  cost  function  (6),  is  obviously  a 
relaxation  of  the  symmetric  TSP.  Such  a  subgraph  H  consists  of  a  spanning 
tree  of  G,  plus  an  extra  edge.  We  may  further  restrict  H  to  the  class  7 
of  subgraphs  of  the  above  type  in  which  some  arbitrary  node  of  G,  say  node  l, 
has  degree  2  and  is  contained  in  the  unique  cycle  of  H.  For  lack  of  a 
better  term,  the  subgraphs  of  this  class  7  are  called  1-trees.  To  see  that 
finding  a  1-tree  that  minimizes  (6)  is  a  relaxation  of  the  TSP,  it  suffices  to 
realize  that  the  constraint  set  defining  the  family  7  is  (9)  and 
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(20)  I  Z  x. .  +  Z  Z  x.  .  >  1,  ¥  S  c  V,  S*3 

l€Sj€V'S  1J  iuV'SjcS  1J 

i>i  j>i 

(21)  I  Z  x. .  =  n 
iSVj>i  1J 

(22)  Z  x  =2 
j€V  1:1 

Here  (20)  is  a  weakening  of  (10),  (21)  is  Che  sum  of  all  equations  (7) 
divided  by  cwo,  and  (22)  is  Che  first  equation  (7), 

The  minimum-cost  1-tree  problem  is  easily  seen  to  be  decomposable  into 
two  independent  problems : 

(a)  to  find  a  minimum-cost  spanning  tree  in  G  -  { 1 ; ;  and 

(.9)  to  find  two  smallest-cost  edges  among  those  incident  in  G  with  node  1. 

The  n-2  edges  of  the  spanning  tree  found  under  (a),  together  with  the 

2  edges  found  under  (9),  form  a  minimum-cost  1-tree  in  G. 

Solving  problem  (9)  requires  0(n)  comparisons,  whereas  problem  (a) 

can  be  efficiently  solved  by  the  algorithms  of  Dijkstra  [1959]  or  prim  [1957], 

2 

of  complexity  0(n  ),  or  by  the  algorithm  of  Kruskal  [1956],  of  complexity 
0 ( J E ]  log  | E j ) .  Since  the  log  | E J  in  the  last  expression  comes  from  sorting 
the  edges,  a  sequence  of  subproblems  that  requires  only  minor  resorting  of  the 
edges  between  two  members  of  the  sequence  can  be  more  efficiently  solved  by 
Kruskal 's  procedure  than  by  the  other  two. 

The  number  of  1-trees  in  the  complete  undirected  graph  G  on  n  nodes  can  be 

calculated  as  follows:  the  number  of  distinct  spanning  trees  in  G  -  [1}  is 

n-3  n-1 

(n-l>  (Cailev's  formula),  and  from  each  spanning  tree  one  can  get  (  2  distinct 

1- trees  by  inserting  two  edges  joining  node  1  to  the  tree.  Thus  the  number  of 
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L-Crees  in  G  is  jp(n-2)  (n-l)n  “,  which  is  much  higher  than  the  number  of  solu¬ 
tions  to  AP.  Since  G  has  fn-1''!  tours,  on  the  average  the  number  of  tours  among 
the  1-trees  of  a  complete  undirected  graph  is  one  in  every  ~(n-2) (n-l)n  ^/(n-2)!, 
and  hence  the  minimum-cost  1-tree  problem  with  the  same  objective  function  as 
the  TSP  is  a  rather  weak  relaxation  of  the  TSP.  In  the  above  mentioned  computa¬ 
tional  experiment  on  140  randomly  generated  symmetric  problems,  we  also  solved 
the  corresponding  1-tree  problems  and  found  the  value  of  an  optimal  1-tree  to  be 
on  the  average  only  637,  of  v(TSP).  However,  this  relaxation  can  be  considerably 
strengthened  by  taking  the  equations  (7)  into  the  objective  function  in  a  Lagrangean 
fashion,  and  then  maximizing  the  Lagrangean  as  a  function  of  the  multipliers. 

The  problem 


(23) 


L(\)  *  ain  2  Z  c  x  +  I  }  (  2  x  +  Z  x  -  2) 
x€X(7)i£Vj>i  J  J  i€V  j<i  "  J>i 


min  Z  2  (c  .  +  A.  +  A.)x  .  -  2  2  A.  , 

jc€X(T)i€Vj>i  iJ  1  J  2 


where  A  is  any  n-vector  and  X(J”)  is  the  set  of  incidence  vectors  of  1-trees  in 
G,  i.e.,  the  set  defined  by  (9),  (20),  (21'),  (22),  is  a  Lagrangean  relaxation 
of  the  TSP.  From  the  last  expression  in  (23)  and  the  fact  that  X(«H  contains 
all  tours,  it  is  easy  to  see  that  for  any  \,  L(A)  <  v(TSP).  (For  surveys  of 
Lagrangean  relaxation  in  a  more  general  context  see  Geoffrion  [1974],  Fisher 
[1981],  Shapiro  (1979].)  The  strongest  Lagrangean  relaxation  is  obviously  given 
by  \  *  A  such  that 

'.24)  LtA)  3  max  L(A)  . 

* 

'■  i 

Problem  i24)  is  sometimes  called  a  Lagrangean  dual  of  the  ISP, 
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Now  (24)  is  a  much  stronger  relaxation  than  the  1-tree  problem 
with  the  TSP  cost  function.  Indeed,  computational  experience  with  randomly 
generated  problems  has  produced  on  the  average  values  of  L (X)  of  about  99% 
of  v(TSP)  according  to  Christofides  [1979]  (p.  134),  and  of  about  99.7%, 
of  v(TSP)  according  to  Volgenant  and  Jonker  [1982]. 

However,  solving  (24)  is  a  lot  more  difficult  than  solving  a  1-tree 
problem.  The  objective  function  of  (24),  i.e.  the  function  L(>.)  of  (23),  is 
piecewise  linear  and  concave  in  X.  Thus  L(X)  is  not  everywhere  differentiable. 
Held  and  Karp  [1971],  who  first  used  (24)  as  a  relaxation  of  the  TSP,  have  tried 
several  methods,  and  found  that  an  iterative  procedure  akin  to  the  relaxation  method 
of  Agmon  [1954]  and  Motzkin  and  Schoenberg  [1954]  was  the  best  suited  approach 
for  this  type  of  problem.  The  method,  which  turned  out  to  have  been  theoret¬ 
ically  studied  in  the  Soviet  literature  (see  Polyak  [1967]  and  others) 
became  the  object  of  extensive  investigations  in  the  Western  literature  under 
the  name  of  subgradient  optimization,  as  a  result  of  its  successful  use 
by  Held  and  Karp  in  conjunction  with  the  TSP  (for  surveys  of  subgradient  opti¬ 
mization  in  a  more  general  context  see  Held,  Wolfe  and  Crowder  [1974], 

Sandi  [1979]). 

The  subgradient  optimization  method  for  solving  (24)  starts  with  some 

o  lc 

arbitrary  X  =  X  (say  the  zero  vector)  and  at  iteration  k  updates  X  as 

follows.  Find  L(Xk),  i.e.  solve  problem  (23)  for  X  =  Xk.  Let  H(Xk)  be 

Ir  Ir 

the  optimal  1-tree  found.  If  H(X  )  is  a  tour,  or  if  v(H(X  ))  >  U, 

k 

stop.  Otherwise,  for  i£v,  let  d^  be  the  degree  of  node  i  in  H(X  )  . 

k  k 

Th-?n  the  n-vector  with  components  d^  -  2,  icV,  is  a  subgradient  of  L(X)  at  X 

(see  Exercise  d'l .  Set 


where  C  is  the  "seep  length"  defined  by 

(25)  tk  =  a(U  -  L(>k))/  2  (dk  -  2)2 

i£V  1 

with  0  <  a  £  2.  Then  set  k  —  k+1  and  repeat  the  procedure. 

It  can  be  shown  (see  any  of  the  surveys  mentioned  above)  that  the  method 

CD 

k  k 

converges  if  Z  t  =  »  and  lim  t  =0.  These  conditions  are  satisfied  if 
k=l  k  -  » 

one  starts  with  a  =  2  and  periodically  reduces  a  by  some  factor. 

Example  1. 

Consider  the  3-city  symmetric  TS?  whose  graph  is  shown  in  Fig.  1  (only  arcs 
with  finite  cost  are  present).  Initially  U  =  25,  a  =  2,  =  0  for  i  =  1,...,8. 


Fiz.  1.  Initial  graph  G  =  (7,E) 


The  optimal  l-tree,  shown  in  heavy  lines  in  Fig.  1,  has  a  weight  of  L(\  )  =  21. 
At  iteration  0  we  have: 


d°  =  (2,  2,  4,  1, 

t°  =  2(25— 21>/8  = 

=  (0,  0,  2,  -1, 

The  updated  arc  costs  (c. . 

ij 

having  a  weight  of  L(\S  = 


1,  3,  2,  1); 

l; 

-1,  1,  0,  -1). 


+  a.|  +  \j)  and  the  corresponding  optimal  1-tree 
24,  are  shown  in  Fig.  2. 


Fig.  2.  Updated  graph  G  =  (V,E) 


We  nave  d  =  2  for  i  =  1,...,8;  thus  a  tour  has  been  found  and  the  procedure  stODS 
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Held  and  Karp  [19’1]  pointed  out  that  if  is  taken  to  be,  instead  of 
0,  the  vector  defined  by 


=  -(u.  +  v.  V 2  , 

l  l  i 


i-V  , 


where  (u,  v)  is  an  optimal  solution  to  the  dual  of  the  assignment  problem  with 

costs  c..  =  c...  ¥•  i.i,  then  one  alvavs  has  vfH(l°) !  v(AP).  Indeed,  for  this 
ij  ji’ 

choice  of  \°  one  has  from  (23! 


L(>,°)  =  min 


(c . 


xcXCT)  i“V  j  -i 


.  o  .  .  o.  .  o 

, .  )x.  .  -  2  L. 

icV 


=  min 


I  — [  (c  -  u.  -  v.!  (c  .  ,  -  u.  -  v  ) ' 


X  -X(.T)  i:V  j  i 


I  (u.  +  v  ) 
icV  1  1 


_  v(AP)  , 


since  v(AP)  =  I  (u.  +  v  )  and  c,  -  u  -  v.  >  0,  ¥  i.i. 

i£V  1  1  ij  i  j  ~  ’  ,J 

This  kind  of  initialization  requires  of  course  that  one  solve  AP  prior  to 
addressing  problem  (24). 

Helbig  Hansen  and  Krarup  [1974]  and  Smith  and  Thompson  [1977]  distinguish  be 
tween  the  application  of  the  subgradient  procedure  at  the  root  node  of  the 
search  tree  and  at  subsequent  nodes,  by  using  different  starting  vectors  \° 
and  different  stopping  rules. 

k 

Volgenant  and  Jonker  [1982]  use  an  updating  formula  for  X  ,  and  an  ex- 
k 

pression  for  t  ,  different  from  (25)  and  (26),  respectively.  Namely,  they 
take  t  to  be  a  positive  scalar  decreasing  according  to  a  series  with  a 
constant  second  order  difference,  i.e. 


k+1 


-  2t  +  t 


k-1 


(27) 


=  constant. 


a r.d  define  *  by  setting  for  i=V, 

j  j-i  *-*  <  ■ ; 

(2S)  .  k-1  = 

+  0.6c"  (dl“-2)  ■+•  0.4t'lC(d‘<  otherwise 

^  1  j.  1 

It  should  be  mentioned  chat  none  of  the  versions  of  this  subgradient  optimization 
method  can  be  guaranteed  to  solve  (24)  in  polynomial  time  with  a  prespecified 
degree  of  accuracy.  However,  the  stopping  rules  are  such  that  after  a  certain 
number  of  iterations  the  procedure  terminates  with  an  approximation  to  an 
optimal  which  gives  a  (usually  good")  lower  bound  on  L('\ ) . 

Branchint  rules 

3".  -5,  (Held  and  Karp  [1971]).  At  node  k,  let  che  free  edges  of  the  current 
1-tree  (i.e.  chose  in  E^.E^JI.^)  be  ordered  according  to  nonincreasing  penalties, 

and  let  the  first  q  elements  of  this  ordered  set  be  j  =  ; (i^ , j^) , . . . , (i^ , j^) ; , 
where  q  will  be  specified  below.  Define  q  new  subprcblems  by 

Ltc+r  =  \  J'-(VV  :  h  =  r  *  1»***’<? 

(29)  =  \  ,  r  =  1,...,Q-1 

“k-bq  =  \  '-{(i.j)^  :  l  -p  or  j  -p} 

Here  p  c  V  is  such  chat  contains  at  most  one  edge  incident  with 

o,  while  L  contains  two  such  edges;  and  a  is  the  smallest  subscrint 

rC-KI 

:f  on  edge  in  J  for  which  a  node  with  the  properties  of  p  exists. 

This  rule  partitions  the  feasible  set,  and  makes  the  current  1-tree 
infeasible  £:r  each  of  the  new  subproblems  generated, 
t.-.e  r.ev  suoprcblems  is  :ftan  larger  than  necessary . 


but  the  number 
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2R  7 .  ;Smith  and  Thompson  [1977]).  Choose  a  node  •'hose  decree  in  thc- 
current  1-tree  is  not  2,  and  a  maximum-cost  edge  (i,j)  among  those  incident 
with  the  chosen  node.  Then  generate  two  new  subproblems  defined  by 


U 

"k+1 


(30) 


^+2 


~k+l  =  Lk 

^k+2  =  ^k 


This  rule  generates  only  two  successors  of  each  node  k  of  the  search 
tree,  but  the  minimum  1-tree  in  subproblem  k  remains  feasible  for  subproblem 


SR  3 .  (Volgenant  and  Jonker  [1982]),  Choose  a  node  p  whose  degree  in 
the  current  l-tree  exceeds  2.  Such  a  node  is  incident  with  at  least  two  free 
edges,  say  and  (i7»j9)  (otherwise  1^  contains  two  edges  incident 

with  p,  hence  the  remaining  edges  incident  with  p  belong  to  or  should  belong 
to  Z.  ) .  Generate  three  new  suboroblems  defined  by 


v 

“k+1 


^ j.]_  ^  >  J  j_)  i  (i- ■?  > 3 2^  ’ 


(31) 


k+2 


h 


(ii 


.Jx>? 


Ek+3  =Ek^(i: 


» J  x )  ] 


*k+3  "  *k 


If  p  is  incident  with  an  edge  in  ,  then  node  k+1  is  not  generated. 
This  rule  also  partitions  the  feasible  set  and  makes  the  1-tree  at  node  k 
infeasible  for  each  of  the  successor  nodes,  while  the  number  of  successors  of 
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Other  features 

Held  and  Karp  [1971]  and  Smith  and  Thompson  [1977]  use  a  depth  first  sub¬ 
problem  selection  rule,  while  Volgenant  and  Jonker  [1982]  have  implemented  both 
a  depth  first  and  a  breadth  first  rule,  with  computational  results  that  indi¬ 
cate  a  slight  advantage  for  the  depth  first  rule  (in  their  implementation). 

Extension  to  the  asymmetric  TSP 

The  basic  ideas  of  the  1-tree  relaxation  of  the  symmetric  TSP  caTy 
over  to  the  asymmetric  case  (Held  and  Karp  [1970]),  in  that  the  1-tree  in 
an  undirected  graph  can  be  replaced  by  a  1-arborescence  in  the  directed  graph 
G  =  (V,A),  defined  as  an  arborescence  (directed  tree)  rooted  at  node  1, 
plus  an  arc  (i,l)  joining  some  node  i€V\{l}  to  node  1.  The  constraints 
defining  a  1-arborescence,  namely  (4)  and 


(32)  Z  Z  x  >  1  ,  V  S  q  V  :  {l}€S 

icSjSV\S  J  F 

(33)  Z  Z  x. .  =  n 
i€Vj€V  1J 


(34)  Z  x..  =1 

i€V  11 

are  easily  seen  to  be  a  relaxation  of  the  constraint  set  (2),  (4),  (5)  of 
the  TSP. 

The  problem  of  finding  a  minimum-cost  1-arborescence  can  again  be  de¬ 
composed  into  two  independent  problems,  namely  (a)  finding  a  minimum-cost 
arborescence  in  G  rooted  at  node  1,  and  (g)  finding  a  minimum-cost  arc 

(i,l)  in  G.  Problem  (a)  can  be  solved  by  the  polynomial  time  algorithms 

2 

of  Edmonds  [1967]  or  Fulkerson  [1974],  or  by  the  0 (n  )-time  algorithm  of 
Tarjan  [1977]. 


To  obtain  the  Lagrangaan  version  of  the  1-arborescence  relaxation,  one 
forms  the  function 


(35)  L(\)  -  nia  II  c  .x  +  Z  \  (  Z  x  -  1) 

x£X(C)  iSVjcV  J  J  icV  j£V 


min  Z  Z(c..+X.)x..-  Z  X .  , 
x£X(G)  i£Vj£V  1J  1  1J  iC/  1 


where  X(G)  is  the  set  of  incidence  vectors  of  C»  the  family  of  1-arbo- 
rescences  in  G.  Again,  the  strongest  lower  bound  on  v(TSP)  is  of  course 
given  by  \  =  \  such  chat 


(36)  L(\>  =  max  L(X)  , 

X 

and  subgradient  optimization  can  be  used  to  solve  problem  (36).  However, 
computational  experience  with  this  relaxation  (see  Smith  [1975])  shows  it  to 
be  inferior  (for  asymmetric  problems)  to  the  AP  relaxation,  even  when  the 
latter  uses  the  original  objective  function  of  the  TSP. 

4.  Relaxation  III:  the  Assignment  Problem  with  Lagrangean  Objective  Function 
This  relaxation  was  used  for  the  asymmetric  TSP  by  Balas  and  Christofides 
[1981],  It  is  a  considerable  strengthening  of  the  relaxation  consisting 
of  Che  A?  with  the  original  cost  function,  involving  a  significant  computa¬ 
tional  effort,  which  however  seems  amply  justified  by  the  computational 
results  that  show  this  approach  to  be  the  fastest  currently  available  method 
for  this  class  of  problems. 

Consider  the  asymmetric  TSP  defined  on  the  complete  directed  graph  G  =  i'V.A'i, 
in  the  integer  programming  formulation  (1),  (2),  (4),  plus  the  subtour-elixinati on 
tons  traints .  The  latter  can  be  written  either  as  (3^  or  as  (o'*,  but  for  reasons 


i 


•  • 
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so  be  explained  lacer,  we  include  both  <' 3 b  and  f5),  as  well  as  some  oositive 
linear  combinations  of  such  inequalities,  and  write  the  resulting  set  of  subtour- 
elinination  inecualities  in  the  generic  fora 


„  „  t 


_  c.  a.  .x.  .  >  a  , 
i£Vj€V  "  0 


t  €  T 


Thus  our  integer  programing  formulation  of  the  TSP  consists  of  (lb,  (2), 
and  (37).  To  construct  a  Lagrangean  relaxation  of  TSP,  we  denote  by  X  the  feasible 
set  of  AP,  and  associate  a  multiplier  w  ,  tcT,  with  every  inequality  in  the  system 
(37).  We  then  have 


,(w)  =  min  3  I  c. .x  -  3  w  (  I  Z  aC ,x. .  -  aC 

•li-f.-jfV  1J  1J  t€T  i€Vj€V  1J  1J 

=  min  Z  -  (c. .  -  Z  w  a. . )x. .  +  Z  w  a 
xCiizvje;  1J  t~T  C  1J  1J  t£T  c  ° 

Clearly,  the  strongest  such  relaxation  is  given  by  w  =  w  such 


L (w)  =  max  L(w) 
w>Q 


The  Lagrangean  dual  (39)  of  the  TSP  could  be  solved  by  subgradient  optimi¬ 
zation,  like  in  the  case  of  the  1-tree  relaxation  of  the  symmetric  TSP.  However, 
in  this  case  the  vector  w  of  multipliers  has  an  exponential  number  of  compo¬ 
nents,  and  until  an  efficient  way  is  found  to  identify  the  components  that  need 
to  be  changed  at  every  iteration,  such  a  procedure  seems  computa tionally 
expensive.  3aias  and  Christofides  [1981]  therefore  replace  (39)  by  the 
’restricted"  Lagrangean  dual 


max  L  i'w )  , 


such  chat 


-,n 


w  >  0  and  there  exists  u,vi?. 


<;  v 


u .  -  v  .  - 
1  1 


v  a 


c .  .  i  r  x .  . 
ij  ij 


ij 


<  c .  . 
“  ij 


Li  X. 


and  x  is  the  optimal  solution  found  for  the  A?. 


In  other  words,  (40)  restricts  the  multipliers  w^  to  values  that. 


together  with  appropriate  values  »  Vj  >  fora  a  feasible  solution  to  the 


dual  of  the  linear  program  given  by  (1),  (2),  (37)  and  x  >  0  ,  i,j£V. 

This  may  cause  the  value  of  (d0)  to  be  less  than  that  of  <  3 9 > ,  but  it 

leaves  the  optimal  solution  x  to  AP,  also  optimal  for  the  objective  function 

(38).  Thus  (-0)  can  be  solved  without  changing  x.  While  no  good 

method  is  known  for  the  exact  solution  of  (40),  Balas  and  Christofides  [1931] 

give  a  polynonially  bounded  sequential  noniterative  approximation  procedure, 

A  A 

which  yields  multipliers  wc  such  that  L(w)  typically  comes  close  to  v(TSP): 
for  randomly  generated  asymmetric  TSP's,  L(w)  was  found  to  be  on  the  average 
99.5%  of  v(TSP)  (Christofides  [1979],  p.  139-140). 

The  procedure  starts  by  solving  AP  for  the  costs  c.  ,  u  i,j,  and  taking  u^, 
to  be  the  components  of  the  optimal  solution  to  the  dual  of  AP.  It  then  assigns 


values  to  the  multipliers  sequentially,  without  changing  the  values  assigned 


earlier.  We  say  that  an  inequality  (37)  admits  a  positive  multiplier,  if  there 
exists  a  v_  >■  0  which,  together  with  the  multipliers  already  chosen,  satisfies 
the  constraints  of  V.  At  any  stage,  v(TSP)  is  bounded  from  below  by 


>i’  I  u.  +  i  v.  +  I  w  a  , 

i€V  1  jtV  J  t€T  C 


since.  u,  v,  w>  is  a  feasible  solution  to  the  dual  of  the  linear  program  defined 
bv  1  ,  C>,  (2T  ••  and  xt,  ^0,  “  i,j. 
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The  bounding  procedure  successively  identifies  valid  inequalities  that 
vi)  are  violated  by  the  A?  solution  x  ,  and 
(ii)  admit  a  positive  multiplier. 

Such  inequalities  are  included  into  L(w)  in  the  order  in  which  they 


are  found,  with  the  largest  possible  multiplier  w  .  The  inclusion  of  each 
new  inequality  strengthens  the  lower  bound  L(w).  We  denote  by  1:  _  the  re¬ 
duced  costs  defined  by  the  optimal  dual  variables  u^  ,  v.  and  the  multipliers 

w  ,  i . e . ,  c .  .  =  c..  -  u.  -  v.  -  Ewa... 

t  ij  ij  i  J  cuT  t  ij 

Ac  any  given  stage,  the  admissible  graph  G  =  (V,A  )  is  the  spanning 

0  0 

subgraph  of  G  containing  those  arcs  with  zero  reduced  cost,  i.e. 


( —  >  j  )  €  A 


>Ui 


+  _ 
tST 


w  aC . 
t  iJ 


=  c . 


IJ 


where  T  is  the  index  set  of  the  inequalities  included  so  far  in  L(w).  The 

inclusion  of  each  new  inequality  into  the  Lagrangean  function  adds  at  least  one 

new  arc  to  the  set  A  .  Furthermore,  as  long  as  G  is  not  stronglv  connected,  the 

procedure  is  guaranteed  to  find  a  valid  inequality  satisfying  (i)  and  (ii).  Thus 

the  number  of  arcs  in  A  steadily  grows;  and  when  no  more  inequalities  can  be 

found  that  satisfy  (i)  and  (ii) ,  G^  is  strongly  connected.  Finally,  if  at  some 

point  G  becomes  Hamiltonian  and  a  tour  H  is  found  in  G  whose  incidence 
o  o 

vector  satisfies  (37)  with  equality  for  all  tvT  such  that  >  0,  then  H  is  an 
optimal  solution  to  TSP  (see  Exercise  5). 

Three  types  of  inequalities,  indexed  by  ^  ,  T0  and  T^  ,  respectively, 
are  used  in  three  noniterative  bounding  procedures  applied  in  sequence.  We 
will  denote  the  three  components  of  w  corresponding  to  these  three  inequality 
classes,  oy  =  ('  )  —  ,  _  =  and  v*  ( v. ) .  ,  resoectiveiv, 

1-  i-  1  -  1  i  i'i.  .  1 
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Sounding  procedure  1 

This  procedure  uses  the  inequalities  (5)  satisfying  conditions  (i)  and  fii). 

For  any  S -V,  the  set  of  arcs  (S,V  S')  =  _  (i ,  j  )  • :  iiS  ,  j~V  S;  is  called  a  directed 

cutset .  The  inequalities  (5)  corresponding  to  the  node  sets  S  ,  t~T,  can  be 

reoresented  in  terms  of  the  directed  cutsets  X  =  (S  ,  V  S  ) ,  as 

t  t  t 


(i2'  I  x.  .  >  1,  t  €  T 

(i,j)€Kc1J  ‘ 

At  any  stage  of  the  procedure,  the 
is  easily  seen  to  satisfy  conditions 

(A-3>  X_  "  A  =  3  . 

c  O 


l  * 

inequality  corresponding 
(i)  and  (ii)  if  and  only 


to  cutset 
if 


To  find  a  cutset  satisfying  (43),  one  chooses  a  node  i  £  V  and 

forms  its  reachable  set  R(i)  =  <J~Vj there  is  a  directed  path  from  i  to  j }  in 
C'o .  If  R(i)  =  V,  there  is  no  cutset  with  i  (  satisfying  (43),  so  one  chooses 
another  node.  If  R(i)  *  V  for  some  i  ^  ",  then  X_  =  (R(i),  V  R(i))  satisfies  (43) , 

W 

and  the  largest  value  that  one  can  assign  to  the  corresponding  multiplier  \  with¬ 
out  violating  the  constraints  of  W  is  \  =  min  c  ..  Thus  the  ineaualitv 

(i,j)^Kt  ij 

(42)  corresponding  to  K  is  included  in  L(w)  by  setting  the  reduced  costs  to 

c.  -  c,  -  X  ,  (i,j)^X_,  c, ,  -  c, ,  otherwise.  This  adds  to  A  all  arcs  for 
lj  ij  t  t  ij  ij  o 

which  the  minimum  in  the  definition  of  X.  is  attained.  The  search  is  then  started 

again  for  a  new  cutset;  and  the  procedure  ends  when  the  reachable  set  of  every 

node  is  V.  Ac  that  stage  Gq  is  strongly  connected,  and  K^Aq  4  0  for  all 

directed  cutsets  X  in  G.  Also,  from  (41)  and  the  fact  that  aC  =  1,  “  t£T, ,  it 

o  1 

follows  that  procedure  1  improves  the  lower  bound  on  v(TSP)  by  ,  i.e.,  at 
the  end  of  procedure  1  the  lower  bound  is 


3.  =  v(A?)  -  I 

1  t-T  C 

1 


One  can  show  that  bounding  procedure  1  generates  at  most  (h-l)(h+2)/2 


cutsets,  where  h  is  the  number  of  subtours  in  x  (see  Exercise  6).  The  computa¬ 
tional  effort  required  to  find  a  cutset  satisfying  (43)  or  showing  that  none 
exists  is  0(n|A|). 

Example  2 . 

Consider  the  8-city  ISP  whose  cost  matrix  is  shown  in  Table  1. 

Table  1 


1 

2 

3 

4 

5 

6 

7 

8 

1 

X 

O 

11 

10 

8 

7 

6 

5 

2 

6 

X 

1 

8 

8 

4 

6 

7 

3 

5 

12 

X 

11 

8 

12 

3 

11 

4 

! 

9 

10 

X 

1 

9 

8 

10 

5 

ii 

11 

9 

4 

X 

2 

10 

9 

6  j 

12 

8 

5 

2 

11 

X 

11 

9 

7 

10 

11 

12 

10 

9 

12 

X 

3 

8 

7 

10 

10 

10 

6 

3 

1 

X 

Table  2  shows  the  optimal  solution  x  to  AP  (x  =  1  for  (i,j)  boxed  in, 
xij  =  0  otherwise),  the  optimal  solution  (u,v)  to  the  dual  of  AP  (the  numbers 
on  the  rim),  and  the  reduced  costs  c^ .  The  solution  value  is  17.  The  correspon¬ 
ding  admissible  graph  Gq  is  shown  in  Fig.  3. 

Bounding  procedure  1.  Cutset  =  ([l,  2,  3,  7,  8},  {4,  5,  6})  admits 
^  =  cg  6  =  2,  and  cutset  K2  =  ({4,  5,  6],  (1,  2,  3,  7,  8})  admits  3  =  3. 

The  lower  bound  becomes  17  +  2  +  3  ■  22.  The  new  reduced  cost  matrix  is  shown 
in  Table  3  and  the  corresponding  admissible  graph  Gq  in  Fig.  4.  Note  that  Gq 
of  Fig.  4  is  strongly  connected.) 
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Bounding  procedure  2 

This  procedure  uses  the  inequalities  (3)  that  satisfy  conditions  (i)  and 
vii),  i.e.  are  violated  by  x  and  admit  a  positive  multiplier.  To  write  these 
inequalities  in  the  general  form  (37),  we  restate  them  as 


i-:scjssc 


>  i  -  !s.l. 


t  €  X2  . 


The  subtour  elimination  inequalities  (3)  (or  (44))  are  known  to  be 
equivalent  to  (5)  (or  (42)).  Nevertheless,  an  inequality  (44)  may  admit  a 
positive  multiplier  when  the  corresponding  inequality  (42)  does  not,  and  vice 
versa. 

If  S^,...,S^  are  the  node  sets  of  the  h  subtours  of  x,  every  in¬ 
equality  (44)  defined  by  S  ,  t=l,...,h,  is  violated  by  x;  but  a  positive 
multiplier  can  be  applied  without  violating  the  condition  that  x  =  1 

implies  c^  =  0,  only  by  changing  the  values  of  some  u^  and  v^  ,  and  this 
in  turn  can  only  be  done  if  a  certain  condition  is  satisfied.  Roughly  speaking, 
we  have  to  find  a  set  of  rows  I  and  columns  J  such  that,  by  adding  to  each  u^ ,  irl 

and  v.,  j€j  the  same  amount  >  0  that  is  being  added  to  c.  (i,j)€(S  ,  S  ), 
j  J  t  ~  ij  J  t’  t 

we  obtain  a  new  set  of  reduced  costs  c'  such  that  c'  >  0  for  all  (i,j),  and 

ij  ij  ~ 

c/j  =  0  for  all  those  (i,j)  such  that  x^  =  1.  The  condition  for  this  is  best 
expressed  in  terms  of  the  assignment  tableau  of  the  Hungarian  algorithm  whose 
rows  and  columns  are  called  lines .  and  whose  row/column  intersections  are  called 
cells.  Cells  correspond  to  arcs  of  G  and  are  denoted  the  same  way. 


Let  be  the  node  set  of  a  subtour  of  x,  and 

•S  -  C(i,j)€A0!i,j€St:,  a;  -  f(i,j)€Ac!x  -  1] 


(y)  no  (i,j)€AoxAc  is  covered  by  any  line  in  C  . 

If  such  a  set  C  exists,  and  it  consists  of  row  set  I  and  column  set  J, 
then  the  maximum  applicable  multiplier  is 


min  c .  . 
(i,  j)€M  1J 


.mere 


::  =  sj  dc,j)  . 

Proof .  Sufficiency.  Suppose  line  set  C,  consisting  of  row  set  I  and 

column  set  J,  satisfies  (a),  (p),  (v'> .  Then  adding  >  0  to  c^.  for  all 

(i,  as  well  as  to  all  ,  i£l  and  v. ,  j€j,  produces  a  set  of  reduced 

costs  Cj*.  such  that  =  0  for  (i,j)uA^,  since  C  =  I„J  satisfies  (%).  Further, 

since  C  satisfies  (o')  and  (y) ,  cf.  '  c..  =  0  for  all  (i,j)€A^  A^,  and  c'.  =  c. .  =  0 

’  lj  -  ij  t  t’  ij  ij 

for  all  (i,j)€A  'A  .  The  only  reduced  costs  that  are  diminished  as  a  result 
of  the  above  changes,  are  those  corresponding  to  arcs  in  one  of  the  three  sets 
(I,i>,  (I,V'S  "),  (V  S  ,  J)  whose  union  is  the  set  M  of  the  theorem.  Hence 

setting  __  equal  to  the  minimum  reduced  cost  over  M  provides  a  positive  multiplier 

that  can  be  applied  to  the  arcs  in  (S^,  S^. 

Necessity.  Suppose  a  multiplier  _  >  0  can  be  applied  to  the  arc  set  (S_,  S  ^ , 
In  order  to  prevent  the  c^.  for  ( i , j ) •£ A ^  from  becoming  positive,  one  must  increase 
u.  -  v.  b”  .  for  all  <  i ,  j  'oA_' .  If  this  can  be  done,  it  can  be  done  by  adding  _ 
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1 


to  iu  or  v  (but  not  to  both)  for  (i,j)cA/;  and  the  corresponding  index  sets  I 

and  J  form  a  set  C  =  I  J  that  satisfies  (a).  Let  C  be  the  collection  of  all 

sets  C  obtained  in  this  way.  Mow  take  any  CSC.  If  C  violates  (p'i,  then 

c!.  =  c.  .  +  „  -  2u  <  c.  .  =  0  for  some  (i.j)'SA  A^,  and  if  it  violates  (>)  ,  then 
ij  ij  ij  t  t 

d.  <  c, .  =0  for  some  (i.i)SA  A,.  Since  bv  assumption  „  >  0  can  be  applied 
ij  ij  o  t  rr 

to  (St>  S^),  there  exists  at  least  one  set  C € C  that  satisfies  both  (p)  and  (>). 

To  check  whether  for  a  given  subtour-node-set  S  there  exists  a  set  of 
lines  C  satisfying  conditions  (a),  (p),  (v),  we  proceed  as  follows. 

First  we  construct  a  set  R"  of  rows  that  cannot  belong  to  C,  and  a  set  K+ 
of  columns  that  must  belong  to  C,  if  conditions  (a),  (p),  (v)  are  to  be  satisfied. 
To  do  this,  we  start  with  K+  =  0  and  in  view  of  (v),  put  into  R*  all  rows  i  for 
which  there  exists  a  cell  (i,j)dAQ  with  j^V'-.S  .  Then  we  apply  recursively  the 
following  two  steps,  until  no  more  additions  can  be  made  to  either  set: 

If  a  row  i  was  put  into  R',  then  to  satisfy  (-<)  we  put  into  K+  every  column 
j  such  that  (i,j)€A^. 

If  a  column  j  was  put  into  K+,  then  to  satisfy  (3)  we  put  into  R~  every 

row  h  such  that  (h,j)SAt. 

To  state  the  procedure  formally,  we  set  K+  =  0, 

o 


Rq  =  {iSSt|l(i,j)£Ao  with  j€V  S^, 


and  define  recursive) v  for  r  =  l,...,r. 


U  ij€St|jL(i,j)cAt'  with  1€r“  ^ } 
R;  =  R^U  fi€St|Z(i,j)€At  with  j€K+]  . 


Here  r  is  the  smallest  r  for  which  K+  =  K+  -  or  R*  =  R*  ,  . 

r  r- 1  r  r-1 
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Next  we  use  a  perfectly  analogous  procedure  to  construct  a  set  r"1"  of  rows 
that  must  belong  to  C  and  a  set  K  of  columns  that  cannot  belong  to  C,  if  (;,.■, 
(5),  ('•'>  are  to  hold.  In  other  words,  we  set  Rq  =  0, 

K~  =  {jSS ^ (Z(i , j )^Aq  with  i-V  S^j, 


and  define  recursively  for  s  =  l,...,s, 

Rg  =  Rg_i with 

K’  =  Kg_1 'J  (j€St|Z(iJj)cAt  with  i€R^} 


Here  s  =  min'S,,  s_},  where  s,  is  the  smallest  s  such  that  R  =  R  ,  or 
'  1  2  1  s  s-1 

K  =  K  ,  ,  and  s.  is  the  smallest  s  such  that  S+ '  Rl  M  or  K  ~  Kit  ^  0  . 
s  s-1  2  s  r  s  r 

If  s  =  sv  then  some  row  or  some  column  that  cannot  belong  to  C,  must  belong 
to  C  for  (3),  (v)  to  hold;  hence  there  exists  no  set  C  of  lines  satisfying 

(o'!,  (S'),  an’  no  positive  multiplier  can  be  applied  to  the  inequalitv  f A.A 3 


corresponding  to  St- 

If  s  =  s,f  then  the  set  of  lines  C  =  11 J.  where  I  =  S  R—  and  J  =  K^, 

1  t  r  r’ 

satisfies  conditions  (o),  (o),  (v).  Thus  we  include  the  inequality  (44)  corre¬ 


sponding  to  into  L(w)  with  the  multiplier  >  0  defined  in  Theorem  2,  and 


set  the  reduced  costs  to  c. .  —  c. .  -  _  ,  (i,i)vM,  c. . 

ij  lj  t’  lj 

M  is  the  set  defined  in  Theorem  2.) 


c . .  otherwise . 
ij 


(Here 


In  both  cases,  we  then  choose  another  subtour,  until  all  subtours  have  been 
examined.  If  h  is  again  the  number  of  subtours,  bounding  procedure  2  requires 
0 ( i» -  ,A;'>  steps.  It  can  be  shown  (see  Exercise  7)  that  this  procedure  improves 
the  lower  bound  on  vf'TSP'i  by  ,  i.e.,  at  the  end  of  procedure  2  the  lower 


bound  is 


Example  2  (continued! . 


Bounding  procedure  2.  The  subtours  of  x  are  (1,  2,  31,  (4,  5,  n 1  and 
(7,  Sl(see  Table  3  and  Fig.  41. 

For  S  =  '1,  2,  3),  R_  =  {3},  let  =  i'l};  K_  =  *3},  R?  =  :  2 } .  Thus  C  =  I  ,  J 
where  I  =  i  l,  2  j ,  J  =  [l],  and  =  c?  ^  =  1.  For  S,  =  [U ,  5,  6  j ,  Rq  =  (6 j , 

=  (4);  KQ  =  {6},  R^  =  ’.5j,  and  C  =  I'JJ,  with  I  =  {4 ,  5},  J  =  (4j,  and 
1  =  c5  4  =  2*  Finally,  for  =  [l ,  8},  R~  =  {8},  =  {?};  K~  =  {7},  and 

since  K.  2  =  (7}  ^  0,  the  inequality  corresponding  to  subtour  (7,  8)  does  not 

admit  a  positive  multiplier. 

The  lower  bound  becomes  B,,  =  +  p?  =  22  +  1  +  2  =  25.  The  new  reduced 

costs  are  shown  in  Table  4,  and  the  corresponding  admissible  graph  G  in  Fig.  5.,, 


Table  4 


Fig.  5.  Graph  Gq  after  bounding  procedure  2 

Bounding  procedure  3 

The  class  of  inequalities  used  in  this  procedure  is  defined  as  follows. 
Suppose  Gq  has  an  articulation  point,  i.e.  a  node  k  such  that  G  -  {k}  has 
more  than  one  component.  Let  one  of  the  components  have  node  set  S  ,  and  denote 
Wt  =  V\S  U{k).  Then  every  tour  contains  an  arc  of  at  least  one  of  the  cutsets 
Kt  =  and  Kt#  =  ^Wt’St^’  ^ence  clle  incidence  vector  x  of  any  tour 

satisfies  the  inequality 

(45)  T  x. .  >  1  . 

( i  >  j  )  1J 


Furthermore,  (45)  satisfies  condition  (i),  i.e.  is  violated  by  the  AP  solution. 
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3ounding  procedure  3  uses  those  inequalities  (451  that  also  satisfy 

condition  (ii).  Although  every  inequality  (451  is  the  combination  of  some 

inequalities  (31  and  equations  (2)  (see  Exercise  81,  nevertheless  it  is  possible 

to  find  inequalities  (451  that  satisfy  condition  (ii),  i.e.,  admit  a  positive 

multiplier,  when  no  inequality  (3)  (i.e.,  (44'»  1  satisfies  it.  Indeed,  it  is  not 

hard  to  see,  that  if  k  is  an  articulation  point  of  G  and  S  is  the  node  set  of 

o  t 

one  of  the  components  of  G  -  "kj,  then  k'~A  =  (C "  ~  A  =0  and  a  positive 

o  -  t  o  t  o  r 

multiplier  given  by 


(46) 


v„  =  mm 


(ijia^jK"  iJ 


can  be  applied  to  the  arc  set  K '  4 K "  .  On  the  other  hand,  if  G  has  no  articula- 

t  t  o 

Cion  point,  then  for  any  choice  of  the  node  k,  the  minimum  in  (46)  is  0  and  thus 

no  inequality  (45)  admits  a  positive  multiplier. 

Thus  bounding  procedure  3  checks  for  every  i€V  whether  it  is  an  articulation 

point,  and  if  so,  it  takes  the  corresponding  inequality  (45)  into  L(w)  with  the 

multiplier  v  given  by  (46).  This  is  done  by  setting  c. .  -  c. .  -  v  ,  (i , i l^K '  U K. " 

t  ij  ij  t’  ,J  t  t  ’ 

otherwise.  Since  Gq  has  n  nodes,  and  testing  for  connectivity  requires 
0(|a|)  steps,  bounding  procedure  3  requires  0(n|A|)  steps. 

In  view  of  (41)  and  the  fact  that  (45)  has  a  righthand  side  of  1,  at  the  end 
of  bounding  procedure  3  one  has  the  following  lower  bound  on  v(TSP): 


B.,  -  v(AP)  +  11  +  I!  u  +  Z  v 

t€T,  t6T„  t€T„  t 


Example  2  (continued). 

Vertex  6  is  an  articulation  point  of  Gq  (see  Pig.  51.  The  corresponding 
cutsets  are  K'  =  ({4,  5},  (l,  2,  3,  7,  8 })  and  K"  =  ( f  1 ,  2,  3,  7,  8},  (4,  5}), 


and  the  arc  set  admits  the  multiplier  =  c.  ^  =  1.  There  is  no  other 

articulation  point,  and  the  procedure  stops  with  the  lower  bound  B,  =  3  -  , 

3  2  1 

25  +  1  =  2b.  The  new  reduced  costs  are  shown  in  Table  5,  and  the  corresponding 
in  Fig.  6. 


Table  5 


x  [Oj  1  3  0  4  5 


3  j  [o]  9  x  3  2  7  0  8 


Fig.  6.  Graph  G  after  bounding  procedure  3 
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Additional  bounding  procedures 

At  the  end  of  bounding  procedure  3,  is  strongly  connected  and  wit!  '  •" 

articulation  points.  At  that  stage  an  attempt  is  made  to  find  a  tour  in  Gq. 

For  that  purpose  a  specialized  implicit  enumeration  technique  is  applied, 
with  a  cut-off  rule.  If  a  tour  H  is  found  whose  incidence  vector  x  satis- 

A 

fies  with  equality  all  those  inequalities  (37)  such  that  w  >  0,  then  H  is 
optimal  for  the  current  subproblem  (this  follows  from  elementary  Langrangean 
theory) . 

Example  2  (continued).  The  following  tour  can  be  identified  by  inspection 

in  Gq  of  Fig.  6:  H  =  f(l,  2),  (2,  3),  (3,  7), (7,  8),  (8,  6),  (6,  4),  (4,  5),  (5,  1) 

The  value  of  H  is  26,  equal  to  L(w)  =  B^,  the  lower  bound  at  the  end  of  procedure  3 

The  tour  H  contains  exactly  one  arc  of  each  cutset  associated  with  a  positive  \  , 

namely  arc  (8,  6)  of  =  ({l,  2,  3,  7,  8},  {4,  5,  6}),  and  arc  (5,  1)  of 

K9  =  ( (4 ,  5,  6},  (1,  2,  3,  7,  8]).  Thus  the  incidence  vector  of  H  satisfies 

with  equality  the  two  inequalities  (42)  corresponding  to  and  ,  as  required. 

Further,  H  contains  exactly  |S^|  -1=2  arcs  of  the  subtour  with  node  set 

=  (1,  2,  3],  namely,  (1,  2)  and  (2,  3);  and  exactly  |s„,|  -1  =  2  arcs  of  the 

subtour  with  node  set  =  (4,  5,  6},  namely  (6,  4)  and  (4,  5).  Thus  the 

complementarity  condition  is  also  satisfied  for  the  two  inequalities  (44) 

corresponding  to  and  S2.  Finally,  it  contains  exactly  one  arc  of  the  set 

K^JK",  where  =  ({4,  5],  {l,  2,  3,  7,  8]),  k"  =  (fl,  2,  3,  7,  8],  {4,  5}), 

namely  (5,  1):  so  th^  complementarity  condition  also  holds  for  the  inequal  ity 

(45)  corresponding  to  U  .  In  conclusion,  H  is  optimal.il 

*  A 

If,  after  bounding  procedure  3,  a  tour  H  is  found  such  that  x  violates  this 
complementarity  condition  for  some  t  c  T,  then  attempts  are  made  to  replace  those 


inequalities  -  3”''  chac  are  "our  of  kilter,"  i.e.,  for  which  the  complementarity 
condition  is  vioiaced,  bv  "in  kilter"  inequalities  (of  the  same  type*,  i.e., 
inecua iities  that  are  tight  for  x  and  thus  admit  positive  multipliers  satisfying 
the  complementarity  condition.  These  attempts  consist  of  a  sequence  of  three 
additional  bounding  procedures ,  called  4,  5  and  6,  one  for  each  type  of  inequality 
(42),  (44)  and  (45),  respectively.  Bounding  procedure  4  takes  in  turn  each  in¬ 
equality  (42)  which  has  a  positive  multiplier  and  yet  is  slack  for  x,  and  per¬ 
forms  an  exhaustive  search  for  other  inequalities  of  type  (42)  that  could  replace 
the  inequality  in  question  (with  new  multipliers)  and  which  are  tight  for  x.  If 
the  search  is  successful,  the  in  kilter  inequalities  with  their  new  multipliers 
replace  the  out  of  kilter  inequality  and  one  proceeds  to  the  next  out  of  kilter 
inequality  of  type  (42' .  Procedures  5  and  6  perform  the  same  function  for  out 
of  kilter  inequalities  of  type  (44)  and  (45),  respectively.  These  procedures  are 
described  in  detail  in  Balas  and  Christofides  [19811.  When  procedures  4,  5  and 
6  are  not  successful  in  replacing  all  out  of  kilter  inequalities  (and  thus  proving 
H  to  be  an  optimal  tour),  they  nevertheless  strengthen  the  lower  bound  on  v(TSP). 

Each  of  the  six  bounding  procedures  is  polynomially  bounded.  This  (worst 
4  3 

case)  bound  is  0(n  )  for  procedure  1,  0(n  )  for  each  of  the  other  procedures. 
The  mean  times  are  considerably  shorter,  and  on  the  average  procedure  2  (the 
only  one  that  changes  the  dual  variables  ^ ,  v  )  takes  the  longest.  The 
general  algorithm  of  course  remains  valid  if  any  subset  of  the  six  bounding 
procedures  is  used  in  place  of  the  full  set,  but  computational  testing  indi¬ 
cates  that  using  all  6  procedures  is  more  efficient  (i.e.  results  in  smaller 


search  trees  and  shorter  overall  computing  times)  than  using  any  proper  subset. 
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Branching  rules  and  ocher  features 

Before  branching,  ail  arcs  (i,j)  such  char  c\..  >  U  -  L(w)  are  deleted 
from  G.  This  "reduction"  has  removed  on  a  set  of  120  randomly  generated  problems 
(Balas  and  Christofides  [1981]),  on  the  average  96-977,  of  the  arcs  in  problems 
with  up  to  150  variables,  and  98%  in  problems  with  175-325  variables. 

The  AP  relaxation  with  Lagrangean  objective  function  can  of  course  be  used 
with  any  of  the  branching  rules  BR.1  -  BR5  described  in  the  context  of  the  A? 
relaxation  with  objective  function  (1).  Balas  and  Christofides  [1981]  use  two 
rules  intermittently,  namely  BR3  (partitioning  on  the  basis  of  a  subtour  elimi¬ 
nation  inequality  (3)1,  and  another  rule  based  on  a  disjunction  from  a  condi¬ 
tional  bound,  introduced  earlier  in  the  context  of  set  covering  (3alas  [198011. 
This  latter  rule  is  motivated  by  the  following  considerations. 

a 

Let  H  be  the  current  tour  and  x  its  incidence  matrix.  Remove  from  L(wl 
all  those  inequalities  (37)  that  are  slack  while  the  associated  multiplier  is 
positive.  Let  c4  be  the  reduced  costs,  and  L(w')  the  lower  bound,  resulting 
from  this  removal. 

Theorem  3.  Let  S«H,  S  =  f(i„.  i„) . (i_,  11}  be  such  that 

™ 1  I  P  ? 


(471  I  c  >  U  -  L(w), 
r=l  r^r 


and  let  the  arc  sets  Q  — A,  r  =  l,...,p,  satisfy 


2= - :-.£t  j 

r'(i,j)-Qr  Vr 


Then  ever*/  solution  x  to  TSP  such  that  cx  h  u  satisfies  the  disjunction 


■  '  4  =  3,  (  i,  j1--Q  1 

r=  1  ** 


car. 
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Proof.  L(w)  is  the  value  of  an  optimal  solution  to  the  dual  of  the  linear 
program  LP  defined  by  (1),  (2),  x  2  0,  (i,j)£A,  and  those  inequalities  (37) 
with  a  positive  multiplier.  Now  let  x  be  a  feasible  solution  to  LP  that  violates 

(49) .  Then  x  satisfies 

(50)  ^  x,  ■  >  1  ,  r  =  1 , . . . ,  p. 

(i»j)-Qr  J 


Let  LP+  be  the  linear  program  obtained  by  adding  to  LP  the  constraints  (50). 

From  (48),  if  we  assign  the  values  c.  .  ,  r  =  1 , . . . , p  to  the  dual  variables 

rJr 

associated  with  the  inequalities  (50),  we  obtain  a  feasible  solution  to  the  dual 

P 


of  LP+.  But  then  the  objective  function  value  of  this  solution  is  L(w) 
and  hence  from  (47) 


+ 


r 


P 

cx  ^  L(w)  +  I  c.  .  >  U. 

r=l  Vr  “ 

Thus  every  solution  x  to  TSP  such  that  cx  <  U  satisfies  (49) . ■  j 
The  branching  rule  can  now  be  stated  as  follows. 

BR9 .  Choose  a  minimum-cardinality  set  S£H,  S  =  ((i..,  j,),...,(i  ,  j  )}, 

1  1  P  P 

satisfying  (47).  Next  construct  a  px | A  j  0-1  matrix  D  =  (d£  )  (where  r  is  the 
row  index  and  (i,j)  the  column  index),  with  as  many  l's  in  each  column  as  possible, 
subject  to  the  condition  (48)  and  (i^,  j^jSQ^,  r  =  l,...,p,  where 

Qr  =  f(i,j)6A|d[  =  1}. 


Generate  the  p  new  subproblems  defined  by  the  disjunction  (49),  where 
the  r-t^  subproblem  is  given  by 


(51  > 


Ek+r  "  *k  U  Qr 


Xk+r  =  Xk 
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Thd  branching  rule  SR9  is  used  intermittently  with  3R3  because  ac  different 
nedes  the  ranking  of  the  wo  rules  (in  terns  of  strength)  nay  be  different.  The 
choice  is  based  on  certain  indicators  of  relative  strength. 

As  to  subproblem  selection,  the  Balas-Christofides  algorithm  uses  a  mixture 
of  depth  first  and  breadth  first:  a  successor  of  the  current  node  is  selected 
whenever  available;  otherwise  the  algorithm  chooses  a  node  k  that  minimizes 
the  function 

E(k>  '  «•<»>*  -  rsW~Vk)T  • 

where  L(w)k  is  the  value  of  L(w)  at  node  k,  v(AP)  is  the  value  of  the  initial  AP, 
while  s(0^  and  s(k)  are  the  number  of  subtours  in  the  solutions  to  the  initial 
AP  and  the  one  at  node  k,  respectively. 


5.  Other  Relaxations 


For  the  same  reasons  as  in  the  case  ot  the  AP  relaxation  with  the  original 
objective  function,  the  A?  relaxation  with  the  Lagrangean  objective  function  is 
inefficient  (weak)  in  the  case  of  the  symmetric  ISP.  Limited  computational 
experience  indicates  that  on  the  average  the  bound  L(w)  attains  about  967. 
of  v(TSP) ,  which  compares  unfavorably  with  the  bound  obtained  from  the  1-tree 
relaxation. 

On  the  other  hand,  the  main  reason  for  the  weak  performance  of  AP-based 
relaxations  in  the  case  of  symmetric  problems,  namely  the  high  frequency  of 
subtours  of  length  2  in  the  optimal  AP  solution,  can  be  eliminated  if  AP  is 
replaced  by  the  2-matching  problem  in  the  undirected  graph  G  *  (V,E). 
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The  2-natchins  relaxation 

The  problem  of  minimizing  the  function  (6)  subject  to  constraints  (7)  and 
1.9';  is  known  in  the  Literature  as  the  2-matc'ning  problem,  and  is  obviously 
a  relaxation  of  the  TSP.  Bellmore  and  Malone  [1971]  have  used  it  for  the 
symmetric  TS?  in  a  way  that  parallels  their  use  of  the  AP-relaxation  for  the 
asymmetric  TSP.  A  2-matching  is  either  a  tour  or  a  collection  of  subtours, 
and  the  branching  rules  BR2  -  3R5  based  on  the  subtour-elimination  inequalities 
(3)  and  (5)  for  the  asymmetric  TSP  have  their  exact  parallels  in  branching  rules 
based  on  the  subtour  elimination  inequalities  (8)  and  (10)  for  the  symmetric  TSP. 

The  objective  function  (6)  can  be  replaced,  just  like  in  the  case  of  the 
A?  relaxation,  with  a  Lagrangean  function  using  the  inequalities  (S)  and/or 
(10).  The  Lagrangean  dual  of  the  TSP  formulated  in  this  way  is  as  hard  to 
solve  exactly  as  in  the  asymmetric  case,  but  it  can  be  approximated  by  a  pro¬ 
cedure  similar  to  the  one  used  by  3alas  and  Christofides  [1981]  with  the 
AP-relaxation.  Further  facet  defining  inequalities,  beyond  (8)  and  (10), 
based  on  the  work  of  Grotschel  and  Padberg  [1979],  can  be  used  to  enrich  the 
set  drawn  upon  in  constructing  the  Lagrangean  function. 

Although  the  2-matching  problem  is  polynomially  solvable  (Edmonds  [1965])> 
the  main  impediment  in  the  development  of  an  efficient  branch  and  bound  proce¬ 
dure  based  on  the  2-matching  relaxation  has  so  far  been  the  absence  of  a  good 
implementation  of  a  weighted  2-matching  algorithm.  However,  as  this  difficulty 
is  likely  to  be  overcome  soon,  the  2-matching  relaxation  with  a  Lagrangean 
objective  function  will  in  all  likelihood  provide  bounds  for  the  symmetric  TS? 
conoarable  to  chose  obtained  from  the  1-tree  relaxation. 


i 
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The  n-path  relaxation 

The  problem  of  minimining  (1)  subject  to  the  constraint  that  the  solution 
be  the  incidence  matrix  of  a  directed  n-pach  starting  and  ending  at  a  fixed 
node  v  (where  "path"  is  used  in  the  sense  of  walk,  i.e.,  with  possible  repeti¬ 
tions  of  nodes,  and  n  denotes  the  length  of  the  path)  is  clearly  a  relaxation 
of  the  TSP.  An  analogous  relaxation  of  the  symmetric  IS?  can  be  formulated  in 
terms  of  n-paths  in  the  associated  undirected  graph.  Furthermore,  the  constraints 
(2)  in  the  asymmetric  case,  or  (7)  in  the  symmetric  case,  can  be  used  to  replace 
the  objective  function  (1)  or  (6),  respectively,  by  a  Lagrangean  function  of  the 
same  type  as  the  one  used  with  the  1-arborescence  and  1-tree  relaxations.  This 
family  of  relaxations  of  the  TSP  was  introduced  by  Houck,  Picard,  Quevranne  and 

Vemuganti  (19~7T.  The  (directed  or  undirected)  n-path  problems  involved  in  this 

3 

relaxation  can  be  solved  by  a  dynamic  programming  recursion  in  0(n  )  steps. 
Computational  experience  with  this  approach  seems  to  indicate  (Christofiaes  [1979], 
p.  lil'i  that  the  quality  of  the  bound  obtained  is  comparable  to  the  one  obtained  from 
the  1-arborescence  relaxation  in  the  asymmetric  case,  but  slightly  weaker  chan 
the  bound  obtained  from  the  1-tree  relaxation  in  the  symmetric  case.  Since 
solving  the  1-tree  and  1-arborescence  problems  is  computationally  cheaper  than 
solving  the  corresponding  n-path  problems,  this  latter  relaxation  seems  to  be 
dominated  (for  the  case  of  the  "pure"  TSP)  by  the  1-tree  or  1-arborescence  re¬ 
laxation.  However,  the  n-pach  relaxation  can  easily  accommodate  extra  condi¬ 
tions  which  the  1-cree  and  1-arborescence  relaxations  cannot,  and  which  often 
occur  in  problems  closely  related  to  the  TSP  (traveling  salesman  problems  with 
side  constraints  appear  in  vehicle  routing  (see  Chapter  12  of  this  book)  and 
other  tractical  contexts.) 
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*4  substantial  generalization  of  the  n-path  relaxation,  due  to  Chris  to  rides , 
. 'tr.gozz i  and  iOth  [1931;  and  called  state-space  relaxation,  has  the  same 
desirable  characteristics  of  being  able  to  easily  accommodate  side  constraints. 


Ihe  ’_?  with  cutting  olanes  as  a  relaxati. 


Excellent  computational  results  have  been  obtained  recently  by  Crowder 
and  Paaberg  [1980]  for  the  symmetric  TSP  by  a  cutting  plane/branch  and  bound 
approach.  It  applies  the  primal  simplex  method  to  the  linear  program  defined 
bv  (o) ,  (7),  x.  .  >0,  » i ,  j  ,  and  an  unspecified  subset  of  the  inequalities 
defining  the  convex  hull  of  incidence  vectors  of  tours,  generated  as  needed 
to  avoid  fractional  pivots.  The  procedure  uses  mostly  inequalities  of  the 


form  i.IO),  but  also  other  facet  inducing  inequalities  from  among  those  intro- 

•I 

ducea  by  C-rotschel  ana  padberg  [1979].  When  the  search  for  the  next  inequality 
needed  for  an  integer  pivot  fails,  the  procedure  branches.  Since  the  main 
feature  of  this  approach  is  the  identification  of  appropriate  inequalities  to 
be  added  to  the  linear  program  at  each  step,  it  is  being  reviewed  in  the 
chapter  on  cutting  plane  methods. 


6.  Performance  of  State  of  the  Art  Computer  Codes 
In  this  section  we  review  the  performance  of  some  state  of  the  art  branch 
and  bound  codes  for  the  TSP,  by  comparing  and  analysing  the  computational  results 
reported  by  the  authors  of  these  codes. 


The  asymmetric  TS? 

The  three  fastest  currently  available  computer  codes  for  the  asymmetric  TS? 
seem  to  be  those  of  3alas  and  Christofides  [1981],  Carpaneto  and  Toth  [1380] 
ini  Smith.  S r  ini -.-as  am.  and  Thompson  [1977],  to  be  designated  in  the  following  by 
CC  .  IT  ar.c  337,  respectively .  The  main  characteristics  of  these  codes  are  5u.-m.a- 
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rized  in  Table  6.  Table  7  describes  the  computational  results  reported  by  the 
authors  of  the  codes.  Each  of  the  codes  was  run  on  a  set  of  (different)  asymmet¬ 
ric  TSP's  whose  costs  were  independently  drawn  from  a  uniform  distribution  of  the 
integers  in  the  interval  [1,1000].  The  entries  of  the  table  represent  averages 
for  5  problems  (SST),  20  problems  (CT)  and  10  problems  (3C),  respectively,  in 
each  class.  The  number  of  nodes  in  the  SST  column  is  not  strictly  comparable 
with  that  in  the  CT  and  BC  columns,  since  it  is  based  on  counting  only  those 
nodes  that  were  selected  for  branching  and  processed.  Also,  the  computing  times 
are  not  strictly  comparable  without  a  correction,  since  the  CDC  7600  is  about  3 
times  faster  than  the  Uh'IVAt  1108  and  the  CDC  6600  (Computer  Review,  GML  Corp., 
Lexington,  MA,  1979'!.  The  picture  that  emerges,  however,  by  comparing  the 
figures  wi thin  each  column,  for  any  of  these  three  codes,  is  a  pattern  of  growth 
in  computational  effort  with  problem  size,  that  seems  rather  modest  for  a  problem 
usually  viewed  as  "notoriously  intractable".  We  will  discuss  the  functional 
relationship  between  problem  size  and  computational  effort  in  some  detail  further 
below. 

For  problems  in  the  range  40  n  4  180,  the  number  of  nodes  generated  by 
the  BC  algorithm  is  considerably  smaller  than  the  corresponding  numbers  for  the 
other  two  algorithms,  although  CT  uses  a  "breadth  first"  branching  strategy,  meant 
to  minimize  the  number  of  nodes  generated,  at  the  cost  of  increased  storage 
requirements.  The  reason  for  this  is  that  the  Lagrangean  bounding  function  used 
by  BC  changes  the  ranking  of  tours  among  the  assignments,  removing  from  considera¬ 
tion  many  assignment  problems  whose  value  in  terms  of  the  original  objective 
function  is  higher  than  that  of  the  optimal  TSP,  and  which  therefore  must  be 
processed  by  the  CT  algorithm.  On  the  other  hand,  in  the  range  200  _  n  _  240, 
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3C  seems  to  generate  more  nodes  chan  CT ;  the  reason  for  rhis  mav  be  chat  ac 
this  point  the  advantage  of  the  ''breadth  first"  strategy  used  bv  CT  outveizhs 
that  of  the  stronger  bounding  procedures  used  by  3C .  This  seems  to  suggest  that 
an  algorithm  based  on  the  Lagrangean  bounding  procedures  of  3C,  but  using  the 
"breadth  first"  node  selection  strategy  of  CT,  will  generate  fewer  nodes  for 
any  problem  size,  than  either  the  CT  or  the  BC  algorithms.  This  is  undoubtedly 
true,  but  notice  that  at  the  current  state  of  the  art,  the  limiting  factor  in 
the  use  of  both  algorithms  is  not  computing  time  (which  has  never  exceeded  1.5 
minutes  for  any  problem'),  but  (in  core')  storage  space. 

The  S"mr.e trie  TS? 

ihe  fastest  currently  available  branch  and  bound  codes  for  the  svrnmecric  TS? 
se.m  to  be  those  of  Smith  and  Thompson  [1977]  and  Volgenant  and  Jenker  [1982], 
to  be  designated  in  the  following  by  ST  and  7J,  respectively.  Table  8  summa¬ 
rizes  their  main  characteristics,  while  Table  9  reports  on  their  computational 
performance . 

Again,  each  cf  the  two  codes  was  run  on  a  sec  of  (different)  symmetric 
ISP’s  whose  costs  were  independently  drawn  from  a  uniform  distribution  of  the 
integers  in  the  interval  [1,1000],  The  entries  of  the  table  represent  averages 
for  15  problems  (except  for  n=80,  where  the  entry  for  SST  is  the  average  for 
5  problems  only).  The  CYBER  750  is  about  3  times  faster  than  the  UNIVAC  1108. 

3och  codes  were  also  tested  on  randomly  generated  symmetric  Euclidean 
TSr's,  which  required  for  each  code  a  greater  computational  effort  (e.g..  for 
n  =  50  the  average  number  of  subgradient  iterations  was  3049  for  ST  and 
123-  for  •.•.;■) . 


Table  A.  Sunnarv  descriorion  of  :wo  codes  for  the  s’-tcnecric  TSP 
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Lagrangean  objective 
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subgradient 
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Table  9.  Computational  results  on  randomly  generated  symmetric  ISP's 


22.1  j 

3*1.1  j  1.7 

61.5  j 
S3 .0  !  15.: 


li  Lumber  of  nodes  that  were  explored:  (2)  not  reported;  i,3)  UNIT AC  1  LOS ; 
1)  CY3ZR  T50. 
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Average  performance  as  a  function  of  problem  size 

TSP  is  well  known  Co  be  MP-complete ,  hence  in  all  likelihood  there  is  no 
polynomial  time  L'SP  algorithm,  i.o.  no  algorithm  guaranteed  to  solve  every  instance 
of  TSP  in  a  number  of  steps  polynomial  in  n.  However,  this  statement  refers  to 
the  worst  case  behavior  of  algorithms,  and  does  not  exclude  the  existence  of 
algorithms  whose  performance,  though  exponential  in  the  worst  case,  is  on  the 
average  polynomial  in  n.  To  make  the  colloquial  term  "on  the  average"  more 
precise,  assume  the  costs  c„  of  TSP  are  random  numbers  drawn  inde¬ 
pendently  from  a  uniform  distribution  over  the  unit  interval.  Whether  the 
expected  time  required  to  solve  such  a  problem  is  an  exponential  or  polynomial 
function  of  n,  is  at  present  an  open  question,  on  which  the  opinion  of  experts 
is  divided  (see,  for  instance,  Bellmore  and  Malone  [1971],  and  Lenstra  and 
Rinnoov  Kan  [1978]). 

While  the  theoretical  issue  remains  unsolved,  it  is  not  irrelevant  to 
examine  from  this  point  of  view  the  empirical  performance  of  some  of  the  more 
efficient  algorithms  on  randomly  generated  TSP's.  In  a  recent  study,  Balas, 

McGuire  and  Toth  [1983]  have  fitted  three  different  approximating  curves  to  the 
data  of  Table  7  for  each  of  the  three  codes  SST,  CT  and  BC  for  the  asymmetric 
TSP,  in  an  attempt  to  determine  which  of  the  three  types  of  functions  describes 
best  the  behavior  of  each  algorithm.  The  data  of  Table  7  were  corrected  for  the 
difference  in  speed  between  the  CDC  7600  and  the  other  two  computers  by  multiplying 
by  3  the  computing  times  reported  for  the  Balas-Christofides  code.  The  functions 
examined  were: 


f  (n) 
f  (n) 


=  an 

=  an 


e 

Slogn 


f  <  n  i 


(polynomial ) , 
CsuperpolvnomiaT  . 
(exponential')  , 


where  log  stands  for  the  natural  logarithm  and  e  for  its  base. 
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Each  of  Che  three  functions  was  expressed  in  logarithmic  form,  and  a  simple 

regression  of  log  f(n)  was  :r-n  on  log  n  (in  the  case  of  the  polynomial  function), 
•1 

on  log"n  (in  the  case  of  the  superpolynomial  function),  and  on  n  (in  case  of 
the  exponential  function),  in  order  to  find  the  best  fitting  values  of  a  and  5 
for  each  case.  The  outcome  is  shown  in  Tables  10,  11  and  12. 

Table  10.  Statistical  analysis  of  the  Sraith-Srinivasan-Thompson  algorithm 


50  <  n  <  180 


'  Type  of 
function 

Best  fit  | 

Standard  j 

error  of 
es  timation 

Coefficient  of 
determination 

Polynomial 

} 

0.38  x  10'5  x  n3"473 

|  0.505 

1 

0.883 

i 

l  Superpolynomial 

0.105  x  10_1  x  n°-3771°8n 

0.519 

0.877 

I 

i  Exponential 

1.19  x  e°'0326n 

0.595 

0.838 

Table  11.  Statistical  analysis  of  the  Carpaneto-Toth  algorithm 


40  <  n  <  240 


r 

1 

j  Type  of 
j  function 

Best  fit 

Standard 
error  of 
estimation 

i 

Coefficient  of 
determination 

j  Polynomial 

0.26  x  10“3  x  n2'261 

■  ; 

0.193 

0.978 

Superpolynomial 

0.47  X  W1  x 

: 

0.255 

, 

0.962  j 

j  Exponential 

1.05  x  e°-0184n 

0.488 

0.860  | 
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Table  12.  Statistical  analysis  of  the  Balas-Chris tofides  algorithm 


50  '  n  325 


Type  of 
function 

Best  fit 

S  tandard 
error  of 
estimation 

-  ■  ’  ■  —  i 

Coefficient  of 
determination 

3. 114 

1 

Polynomial 

0.5  x  10  x 

1 

0.361 

0.962 

Superpolynomial 

0.87  x  10-3  x 

R0. 3201ogn 

0.260 

1 

1 

0.980 

Exponential 

0.85  X  10-1  x 

0.0205n 

e 

0.199 

1  0.989  1 

These  results 

suggest  that  in 

the  limited 

range  of  n  for 

which  the 

algorithms  were  tested  (40  <  n  <  180  for  SST,  40  <  n  <  240  for  CT,  and  50  n  _  325 
for  BC) ,  their  behavior  can  be  almost  equally  well  described  by  any  of  the  three 
types  of  functions  considered.  Although  the  rankings  given  by  the  coefficient  of 
determination  seem  to  be  polynomial/ superpolynomial/ exponential  for  SST  and  CT, 
versus  exponential/superpolynomial/polynomial  for  BC,  the  differences  between 
the  coefficients  of  determination  for  the  three  function  types  are  too  small  in 
comparison  to  the  differences  between  the  same  coefficients  for  the  different 
algorithms,  in  order  to  attach  much  significance  to  these  rankings.  Further  caution 
and  reservations  are  in  order  because  of  the  considerable  differences  in  the 
range  of  n  over  which  the  three  codes  were  tested. 

In  an  attempt  to  obtain  a  more  meaningful  ranking  of  the  three  types  of 
approximation  curves,  the  range  of  n  for  each  of  the  three  algorithms  was  then 
broken  up  into  two  approximately  equal  parts,  and  the  same  three  function  types 
were  fitted  separately  to  the  data  in  the  lower  half  and  in  the  upper  half  of 
the  range  of  n.  The  results,  shown  in  Tables  13,  14,  15,  yield  the  same  rankings 


m 
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Table  13.  The  Smith-Srinivasan-Thompson  algorithm,  with 


splitting  of  the  range  of 

n 

Type  of 
function 

Best  fit 

S  tandard 
error  of 
estimation 

Coefficient  of 
determination 

5°  <  n  £  110 

i 

Polynomial 

0.22  X  10'5  x  n3,64;> 

|  0.582 

1 

0. 748 

Super polynomial 

0.58  x  10'2  x  n°-4191°gn 

0.599 

0.732 

Exponential 

0.368  X  e°-0457n 

0.664 

0.672 

120  £  n  <  180 

Polynomial 

0.14  x  10‘4  x  n3*243 

0.5183 

0.400 

Super polynomial 

0.43  x  10"1  x  n°-3271°8n 

1 

0.5155  , 

0.406 

Exponential 

4.48  x  *0-0225" 

0.5041 

0.432 

Table  14. 

The  Carpaneto-Toth  algorithm,  with 
splitting  of  the  range  of  n 

Type  of 
function 

Best  fit 

Standard 
error  of 

es  timation 

- 

Coefficient  of 
determination 

Polynomial 

40  £  n  <  120 

0.45  X  10-4  x  n2*689 

0.116 

0.990 

Super polynomial 

0.12  x  10_1  x  n°-3171°Sn 

0.120 

0.989 

Exponential 

0.0364n 

0.33  x  e 

0.237 

0.959 

Polynomial 

140  <  n  <  240 

0.22  X  ID'1  x  n1-406 

0.141 

0.698 

Super polynomial 

0.9  x  n°'1341°8n 

0.138 

0.708 

Exponential 

9.0  x  e° • 0073n 

0.128 

0.749 

I 
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Table  15.  The  Ba  las -Chris tof ides  algorithm,  with 
splitting  of  the  range  of  n 


S  tandard 

Type  of  error  of  Coefficient  of 

function  Best  fit  estimation  determination 


50  <  n  <  175 


Polynomial 

0.14  x  lO-4  x  n2'407 

0.288 

0.937 

Superpolynomial 

0.74  X  lO’2  x  n0-2671ogn 

0.242 

0.956 

Exponential 

0.54  x  10-1  x  n°-0245n 

0.120 

_ 

0.989 

200  <  n  <  325 

! 

j  Polynomial 

„  c  -1  4.379 

0.5  x  10  x  n 

0.100 

0.984 

Superpolynomial 

0.11  x  10‘3  x  n0'3951°8n 

0.095 

0.986 

Exponential 

0.199  x  e°'0170n 

0.087 

0.988 

as  before  for  the  lower  half  of  the  range  of  n,  but  almost  completely  reverse 
the  rankings  for  the  upper  half  of  the  range:  ignoring  differences  of  less  than 
0.01  in  the  coefficient  of  determination,  the  exponential  function  ranks  first 
over  this  range  for  both  the  SST  and  CT  algorithms,  with  the  polynomial  and 
super polynomial  functions  tied  for  second  place;  whereas  for  the  BC  algorithm, 
all  three  functions  are  now  tied.  To  the  cautionary  note  voiced  earlier,  we 
should  now  add  the  fact  that  the  coefficient  of  determination  for  this  range  of 
n  (i.e.,  the  upper  half)  is  considerably  weaker  for  SST  (0.40-0.43)  and  CT  (0.70-0.75) 
than  for  the  full  range  of  n,  while  for  BC  it  is  about  the  same,  i.e.,  rather 
strong  (0.98-0.99).  The  findings  listed  above  are  supported  by  additional  statis¬ 
tical  evidence,  for  which,  as  well  as  for  the  methodological  details  of  the 
analysis,  the  reader  is  referred  to  Balas,  McGuire  and  Toth  [19831. 

The  conclusions  that  we  draw  from  this  statistical  analysis  are  as  follows. 
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First,  over  the  limited  range  of  n  for  which  data  are  available,  the  per¬ 
formance  of  the  three  algorithms  analyzed  C3n  be  described  almost  equally  well 
by  each  of  the  three  function  types  considered:  polynomial,  superpolynomial  and 
exponential.  Second,  while  the  best  fitting  polynomial  functions  are  of  a  moderate 
degree  (ranging  between  1.4  and  4.4),  the  best  fitting  exponential  functions  have 
a  base  very  close  to  1  (ranging  between  e0,(-)‘*6  _  ^079  and  e0-007  _  Note 

that  an  exponential  function  of  this  type  is  very  different  from  the  function  en. 
While  the  value  of  the  latter  increases  more  than  twice  whenever  the  variable  goes 
from  n  to  n  +  1,  the  value  of  1.012n  increases  only  by  1.2  percent  when  the  variable 
goes  from  n  to  n  +  1. 
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EXERCISES 

1.  Show  chac,  if  the  relaxation  R  of  TSP  used  in  the  branch  and  bound 

procedure  of  section  1  (either  version)  has  a  finite  solution  set,  and  the 
branching  rule  is  such  that  at  every  node  i  at  least  one  solution  to  the  re¬ 
laxed  problem  R.  becomes  infeasible  for  all  the  successor  Droblems  R . R 

1  ll  iq 

then  the  procedure  is  finite.  For  the  rooted  tree  representation  of  the  branch 
and  bound  procedure  discussed  in  section  1,  what  is  the  maximum  depth  of  the 
tree,  i.e.,  the  maximum  length  of  a  path  joining  any  node  to  the  root?  Give 
a  bound  on  the  number  of  nodes  of  the  rooted  tree. 

2.  Let  x*  =  (x*J  be  an  optimal  solution  to  the  assignment  problem  AP, 

and  let  AP^  be  the  assignment  problem  obtained  from  AP  by  adding  the  constraint 

x.  .  =0  for  some  (i  ,  j  )  such  that  x*  .  =1.  Describe  a  version  of  the 

11  o  Jo  l  j 

oJo  oo 

Hungarian  method  that  starts  with  x*  and  finds  an  optimal  solution  to  AP^  in 
2 

0(n  )  steps.  (Hint:  show  that  only  one  labeling  is  required.) 

3.  Show  that  the  "breadth  first"  rule  of  always  choosing  the  node  with 
the  best  lower  bound  produces  a  search  tree  with  a  minimum  number  of  nodes, 
if  (i)  every  node  selection  is  uniquely  determined,  i.e.,  there  are  no  ties 
for  the  best  lower  bound;  and  (ii)  the  branching  and  bounding  at  any  given 
node  is  done  only  on  the  basis  of  information  generated  on  the  path  from  the 
root  of  the  tree  to  the  given  node.  Construct  examples  to  show  that  neither 
(i)  nor  (ii)  is  sufficient  by  itself  (Fox,  Lenstra,  Rinnoov  Kan  and  Schrage 
[1978] ).  Assuming  that  conditions  (i),  (ii)  are  not  satisfied,  describe  a 
subproblem  selection  rule  that  combines  some  of  the  advantages  of  "breadth 
first"  with  some  of  those  of  "depth  first"  (Forrest,  Hirst  and  Tomlin  [1974), 
Balas  [  1975]  ) . 
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4.  A  subgradient  of  a  convex  function  f(x)  at  x  =  x  is  a  vector  s 


sue.  that 


k  k 

fix''  -  f(x  !  _  s(x-x  )  ,  ¥■  x, 

If 

and  the  subdifferential  of  f(x)  at  x  is  the  set  of  all  subgradients  of 

...  k 

r(x)  at  x  =  x  . 

Let  2”  be  the  family  of  1-trees  in  G  =  (V,E)  introduced  in  section  3, 

and  let  X(T)  denote  the  set  of  incidence  vectors  of  1-trees.  Show  that,  if 

1c 

H ( X  1  Is  a  1-tree  whose  incidence  vector  x  minimizes  the  function 


I  I 
i£v  j  ^  i 


+  X^)x.  . 
J  iJ 


*  lc 

cn  X(«i),  and  d^  is  the  degree  of  node  i  in  H(\  ),  then  the  n-vector  whose 

lc  .  lc 

components  are  d^  -  2,  i€V,  is  a  subgradient  of  L(X)  at  4  •  Identify  the 

subdifferential  of  L(X)  at  X  . 

5.  Let  Gq  =  (V,Aq)  be  the  admissible  graph  defined  in  section  4  with 

respect  to  (u,  v,  w) ,  and  let  x  be  the  incidence  vector  of  a  tour  H(x)  in  G  . 

o 

Show  that  H(x)  is  an  optimal  tour  in  G  if  x  satisfies  inequality  (37)  with 
equality  for  all  t“T  such  that  wt  >  0.  Is  this  sufficient  condition  also 
necessary?  (H.ut:  use  the  optimality  conditions  for  the  linear  program  de¬ 
fined  by  (1),  (2),  (37)  and  x  >  0,  ¥■  i,j,  and  its  dual.) 

6.  Show  that  bounding  procedure  1  of  section  4  generates  at  most 
(h-l)(h+2)/2  cutsets,  where  h  is  the  number  of  subtours  in  the  optimal  solu¬ 
tion  x  to  AP.  (Hint:  use  the  following  facts:  (i)  any  node  of  a  strongly 
connected  component,  hence  of  a  subtour,  is  reachable  from  any  other  node; 

(ii)  every  directed  cutset  that  is  generated  adds  to  Aq  at  least  one  new 
arc  joining  some  subtour  to  some  other  subtour;  and  (iii)  when  two  subtours 

are  joined  by  arcs  in  both  directions,  they  form  a  strongly  connected  component.) 
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7.  Show  chat,  if  is  the  lower  bound  on  vCTSP'i  obtained  bv  bounding 
procedure  1,  the  lower  bound  generated  by  bounding  procedure  2  is 


B„  =  B, 


f=T„ 


(Hint:  use  the  fact  that  if  the  cost  of  each  arc  in  the  set  _  (S  .  S  1 

t-T„  C 


is  increased  by  ^  ,  then  the  value  of  the  solution  x  (hence  of  the  solution 


(u,v)  to  the  dual  of  AP  obtained  at  the  end  of  procedure  2)  is  v(AP)  +  r  js  | ,_  . 

tiT  1  1 


8.  Let  k  be  an  articulation  point  of  the  admissible  graph  G  ,  let  S  be 

o  t 


the  node  set  of  one  of  the  components  of  G  and  consider  the  two  directed 


cutsets 


Kt'  =  £st,  V  k"  =  :v  stL  ,>},  st}. 

Show  that  the  inequality 
-  ~x.  >  1 

is  the  sum  of  the  inequality  (5)  for  S  =  S ^  'J  fk},  the  inequality  (3)  for  S  =  St> 
and  the  equations  (21  for  all  i€St  and  j€Sfc. 

9.  Formulate  the  n-path  relaxation  of  the  TSP  discussed  in  section  5  for 
both  the  asymmetric  and  the  symmetric  cases,  with  a  Lagrangean  function  involving 
the  equations  (2)  (in  the  asymmetric  case)  or  (7)  (in  the  symmetric  case).  Give 
some  examples  of  side  constraints,  i.e.,  extra  conditions,  that  this  relaxation 
of  TSP  can  accommodate  but  the  1-arborescence  or  1-tree  relaxations  can  not. 
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J  This  paper  reviews  the  state  of  the  art  in  enumerative  solution  methods  for  thej 

j  traveling  salesman  problem  (TSP).  The  introHuction  (Section  1)  discusses  the  J 

j  main  ingredients  of  branch  and  bound  methods  for  the  TSP.  S  *ctions  2,  3,  and 
4  discuss  classes  of  methods  based  on  three  different  relaxations  of  the  TSP: 
the  assignment  problem  with  the  TSP  cost  function,  the  l*tree  problem  with  a  j 
Lagrangean  objective  function,  and  the  assignment  problem  with  a  Lagrangean  j 
objective  function.  Section  5  briefly  reviews  some  other  relaxations  of  the  J 

!  TSP,  while  Section  6  discusses  the  perf awiawt  qt  some  statre-of  the  art  ! 


computer  codes.  Besides  material  from  the  literature,  the  paper  also  includes 
the  results  and  statistical  analysis  of  some  computational  experiments  designed 
for  the  purposes  of  this  review. 


